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Descriptton 

EPIDIDYMAL LIPOCALIN GENE AND USES THEREOF 

Cross Reference to Related Applications 
This application is based on and claims priority to United States 
5 Provisional Patent Application Serial No. 60/258.655 filed December 29, 2000, 
the entire contents of which are herein incorporated by reference. 

Grant Statement 

This work was supported by NICHD grant HD36900. Thus, the U.S. 
Government has rights in the invention. 

10 Field of the Invention 

The present invention generally relates to epididymal function and male 
fertility. More particularly, the present invention provides lipocalin nucleic acid 
and polypeptide sequences, a lipocalin gene promoter region that directs gene 
expression in the epididymis, chimeric genes comprising disclosed lipocalin 

15 sequences, and uses thereof. 



20 



25 



Table of Abbreviations 
CAT chloramphenicol acetyl transferaise 

ES embryonic stem cell 

PCS fluorescence correlation spectroscopy 

hEP17 human Epididymal Protein 17 

mE-RABP mouse Epididymal Retinoic Acid Binding 

Protein 

mEPl7 mouse Epididymal Protein 1 7 

MS masis spectroscopy 

PGR polymerase chain reaction 

PGK phosphoglycerate kinase 
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pLN-17 mEP17 targeting vector for honriologbus 

recombination 
RAR Retinbic Acid Receptor 

RT-PCR reverse transcription polymerase chain 

reaction 

SELDI surface-enhanced laser desbrption/ 

ionization 

SPR surface plasmon resonance 

TOP time of flight mass spectroscopy 



Background Art 

Recent studies of reproductive frequency in the United States report that 
7% of married couples (greater than 2 million couples) describe difficulty in 
achieving a pregnancy. See Fidler and Bernstein (1999) Public Health Reports 

15 114:494-511, Many individuals now seek medical support for conception, 
including infertility diagnosis and assisted reproductive treatment. The 
monetary costs for such services are substantial, and financial commitment 
must increase to include pre- and post-natal care of multiple birth pregnancies 
often associated with infertility treatment. A tumult of legal and ethical issues 

20 have emerged riagarding the rights of parents and unbom children that are 
conceived by an unconventional method. The escalating magnitude of 
monetary, legal, and ethical concems when considering pregnancy has 
established infertility as a significant public health issue. 

Rational treatment approaches for many ahdrological disorders resulting 

25 in infertility are still lacking. See Kamischke and Nieschlag (1999) Human 
Reproduction 14(SuppL 1):1-23. The cause of male infertility is often 
unidentifiable, referred to generally as "idiopathic infertility", or the presumed 
pathology is not yet met with.an unequivocal therapy. Intracytoplasmic sperm 
injection has been a successful method for enabling fertilization in many cases, 

30 yet. a less interventive treatment is still sought. The historical use of 
approaches now. deemed ineffective emphasizes the ihiportance of thorough 
studies during eariy stages of therapy development. In particular, there is a 
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need for more sophisticated diagnostic tools that detect molecuiar bases of 
male infertility and for non^surgical therapies' that are supported by Solid 
physiolojgical data. An initial effort in this regard is the development of animal 
models of male infertility. 
5 A related medical incentive is the development of new methods for 

contraception. See Baird and Glasier (1999) BMJ 319:969-972. The 
prevalence of contraceptive use is Increasing worldwide, however, existing 
contraceptive means are limited by adverse side effects, inconvenience, and 
remaining instances of ineffectiveness. In particular, there are presently no 

10 safe and reversible means for male contraception. One strategy that has been 
explored recently is an immunological approach for dismpting endocrine or 
physiological events that normally promote pregnancy. Vaccines that comprise 
antigens of sperm plasma membrane proteins, zona pellucida proteins of the 
egg, or gonadotropin releasing hormone have shown success in suppressing 

15 fertility when administered to several mammalian subjects, including humans. 
See U.S. Patent Nos. 6,096,318 and 6.132.270; Barber and Fayrer-Hosken 
(2000) J Reprod Immunol 46:103-124; Paterson et al. (2000) Cells Tissues 
Organs 166:228-232; Srivastav (2000) J Reprod Ferf// 119:241-252; Feng et 
al. (1 999) J Reprod Med 44:759-765; Naz (1 999) Immunol Rev 1 71 :1 93-202; 

20 Talwar (1 999) Immunol Rev 171 :1 73-1 92. 

Although animal use and clinical trials of immunocontraceptive vaccines 
are encouraging, existing vaccines present significant complications, for 
example, auto^immuhe reactions. Thus, current research Is focused on 
Identifying new antigens that can provide safer vaccines. Animal models with 

25 Implications for post-testicular human male contraceptives acting at the 
epididymis offer promising leads. See Nikkanen et ah (2000) Contraception 
61:401^06; Gpoper and Yeung (1999) Ham Reprod 

establish moleculartargets for vaccine and drug development, proteins that are 
essential for epididymal function have been identified (Srivastav, 2000; 
30 Diekman et al. (1999) /fn/77£/no/ f?e\/ 171:203-211; Costa et al. (1997) Biol 
Reprod 56:985-990; Sonnenberg-Riethmacher et al. (1996) Genes Dev 
10:1184-1193). 
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During their transit through the epididymiSj spermatozoa undergo 
bioGhemical and morphological changes to acquire motility and the ability to 
fertilize an oocyte in vivo. The maturation process occurs progressively along 
the epididymal duct and is believed to depend on epididymal secretory 
5 proteins. The epididymal epithelial cells secrete proteins in a highly regulated 
and regionalized manner such that spermatozoa encounter luminal fluid protein 
in a specific sequence. Indeed, each region within the epididymis is a unique 
microenyironment adapted with a characteristic milieu of ions/organic solutes^ 
proteins, and steroids. See Comwall et al. (2001 ) in "The Epididymis'', Plenurn 

10 Press. Spatially-restricted gene expression as well as regional differences in 
cellular morphology define three distinctive regions of the epididymis known as 
the caput, corpus, and cauda. These structural subdivisions and highly 
regionalized gene expression therein are observed in the epididymis of several 
organisms, including humans (Kmll et al. (1 993) MoiReprod Dev 34:1 6-34). 

15 Regionalization of the epididymis likely fulfills an essential and 

cumulative role in the maturation and survival of spermatozoa. In support 
thereof, targeted mutation of the mouse c-ros tyrosine kinase receptor confers 
male sterility, although sperm production is not affected, c-ros is normally 
expressed in the initial segment of the epididymis, and animals lacking c-ros 

20 function show specific underdevelopment and lack of cellular differentiation 
within the initial segment (Sonnenberg-Riethmacheret al., 1 996), Sperm taken 
from a c-ros mutant mouse are less motile due to flagellar angulation, 
suggesting thatfailure of differentiation of epithelial cells in one segment of the 
epididymis can affect sperm maturation and survival (Yeung et al. (1 999) Biol 

25 Reproof 61:1062-1069). 

To generate regionalization within the epididymal epithelium, gene 
expression is precisely controlled, in part through transcriptional regulation. 
Transcription factors modulate transcription by binding DNA cis-regulatory 

. - * : . sequerices, m often located upstream of the gene promoter and 

30 transcription start site, and by concomitantly affecting assembly of cellular 
transcriptional machinery at the relevant promoter. Therefore, cis-regulatory 
sequences of epididymal-specific genes and the transcription factors that are 
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. > operative through these sites are important elements in uriderst^^^ 

epididymal function as it contributes to sperm maturation; Current approaches 
to identify mechanisms involved in region-specific gene expression in the 
epididymis have been limited by a lack of identffied transcriptional regulatory 
5 proteins which are key to this process. See Cornwall et al. (2001). 

Candidate regulators include components of retinoid signaling pathways. 
Most elements known to be involved in retinoid signaling are present in the 
epididymis, including epididymal retinoic acid binding protein (mE^RABP), 
cellular retinol-binding protein type I (CRBP I), cellular retinoic acid binding 

1 0 protein type I (CRABP I), retinoic acid receptor alpha (RARa). retinoic acid, and 
retinyl esters. Moreover, istudies addressing the function of such elements 
emphasize the important role of retinoid signaling pathways in epididymal 
integrity. In retinoid deficient animals, there is widespread squamous 
metaplasia and keratinization of the epididymal epithelium (Wolbach (1 925) J 

15 Exp Med 42:753^777), and abnormal synthesis and secretion of several 
epididymal proteins (Astraudoetal. (1 995) >Arc/?>A/7c/ro/ 35:247-259). Similariy, 
overexpression of a dominant negative form of RARa leads to disorganization 
of the epididymal epithelium and concomitant infertility (Costa et al. , 1997). In 
a related study, RARa knockout mice display aspermatogenesis and 

20 vacuolization of the epididymal epithelium (Lufkin et al. (1993) Proc Natl Acad 
Sci USA 90:7225-7229), and animals lacking both RARa and RAR7 function 
show epididymal dysplasia (Mendelsohn et al. (1 994) Development 1 20:2749- 
2771), 

The mE-RABP protein is of particular interest among regulators of 
25 retinoid signaling, as it appears to be expressed selebtively in the mid and 
distal caput of the epididymis. mE-RABP is a member of a family of secreted 
lipbcalin proteins. Structural analyses reveal that lipocalins comprise an eight- 
stranded p barrel that is closed at one end by an a^helicail turn, thereby forming 
a hydrophobic binding cavity; This hydrophobic pocket is Well-adapted for 
30 noncovaient binding and transport of small lipophilic ligahds. mE-RABP binds 
active retinoids (S-c/s and ail-trans retinoic acid), and functions as a retinoid 
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carrjer protein in the epididymis. See Ong et al. (2000) Biochim Biophys Acta 
1482(1-2):209-17. 

Recent studies by tlie co-inventors of the present application have 
identified a similargene encoding a 17 IcDa lipocalin, Mouse Epididymal Protein 
5 of 17 kDa (mEP17) (Lareyre et al. (2001) Endocrinology 142:1296-1306). 
mEP17 and mE-RABP are significantly related by several measures that 
collectively suggest mEPI 7 also functions as a regulator of retinoid signaling 
in the epididymis; First, mE-IRABP and mEPI 7 are positioned adjacent to each 
other on mouse chromosome 2. Exon/intron boundaries are strictly consen/ed 

10 between mE-RABP and mEP1 7, supporting that these genes arose by gene 
duplication: Second, mEPI 7 shows regionalized expression in the epididymiis. 
mEP17 expression is limited to the initial segment of the caput epididymis, 
while mE-RABP is expressed in the adjacent mid and distal caput epididymis. 
Third, the mEPIT protein contains two motifs (G-X-W and T-D-Y) and two 

15 cysteine residues that are characteristic features shared by rriembers of the 
lipocalin protein family. With the exception of these motifs, mEP17 shows low 
sequence similarity with other known lipocalins. However, it is well established 
that lipocalin family members do not show significant sequence homology 
(average 25% Identity and 50% homology between representative members). 

20 Rather, lipocalins are more cleariy related by assessing homology of secondary 
and tertiary structure. The tryptophan residue of the G-X-W motif is required 
for binding of lipophilic ligands, and the two cysteine residues form a 
intramolecular disulfide bond that influenpes ligand affinity. In addition, a 
putative signal sequence at the amino-terminal of the mEPI 7 precursor 

25 suggests that it is cleaved to generate a mature secreted protein, consistent 
with its identification as a lipocalin. These structurial similarities between 
mEPI 7 and other lipocalins, most significantly mE-RABP, suggest that mEPI 7 
is also a carrier for retinoid ligand?. 

The present invention relates to a current challenge in developing 

30 animal models of infertility, male fertility treatments, and male contraceptives. 
To this end, the present invention provides an isolated promoter region of the 
mEPI 7 gene, an isolated nucleic acid molecule encoding a human mEPI 7 
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gene (hEP17)/ ah isolated promoter region of hEPI 7, and chimeric genes 
compriising the disclosed sequences. Host cells expressing a recdifrtbinant 
EP1 7 gene or an mEPI 7 promoter region operably linked to a: reporter gene 
sequence are useful in screening assays for discovery of substances that 
5 modulate EP17. A chimeric gene cornprising an mEP17 pronioter region pan 
also be used to direct transcription of a heterologous nucleotide sequence in 
the epididymis of a host organism The present invention further provides an 
EP17 polypeptide that can be used for vaccine or drug development By 
provision of epididymal lipocalin nucleotide and polypeptide sequences, and 
10 methods for using the same, the present invention meets a long-felt heed for 
O advancement in fertility research. 

Summary of the Invention 
The present invention provides an isolated promoter region of an EP17 
gene that reconstitutes endogenous expression in epididymis. In one preferred 

15 embodiment, a promoter region of the invention comprises a 5.3 kb fragment 
(GenBank Accession No. AF08222) of mouse genomic clone 1 0983 (Genome 
Systems, Inc.) between the EcoRV and Sail restriction sites, or functional 
portion thereof. More preferably, the functional portion of the promotei^ region 
comprises a TATA box and at least one cis-acting regulatory sequence 

20 selected from the group including but not limited to a Sp-1 binding site, an AP-1 
w binding site, a retinoic acid receptor binding site, an androgen receptor binding 

site, a G-Ets binding site, a SRY binding site, an AP-4 binding site, a C/EBP 
binding site, and combinations thereof. Most preferably, an isolated promoter 
region of the present invention comprises the nucleotide sequence of SEQ I D 

25 NO:1 , a nucleic acid molecule substantially identical to SEQ ID NO:1 , or a 20 
base pairr nucleotide sequence identical to a contiguous 20 base pair 
nucleotide portion of SEQ ID NO:1. 

The present invention also provides a human EP17 gene. Preferably, 
the human:EP17 gene comprises the sequence set forth as SEQTD:2, a 

30 nucleic acid molecule that Is substantially similar to SEQ ID NO:2; or a nucleic 
acid molecule comprising a 20 base pair nucleotide sequence that is identical 
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to a contiguous 20 base pair secjueiice of SEQ ID NQs:2. The present 



invention further provides an isolated promoter region deiividd from a human 
EP17 gene. In this case, a hEP1 7 promoter region is preferably ah about 5160 
base: pair region immediately upstream of the human EP1 7 transcription start 
5 site. More preferably, an isolated promoter region of the present invention 
comprises a TATA box and at least one cis-acting regulatory sequence 
selected from the group including but not limited to Sp-1 binding site, an AP-1 
binding site, a cAMP response element binding protein (CREB) binding site, a 
SRY-reiated HMG-box gene 5 (Sox5) binding site, a Sex-determining region 

10 Y gene product (SRY) binding site, a c-Ets binding site, a GATA binding site, 
an Octamer transcription factor 1 (Oct-^l) binding site, and combinations 
thereof. Most preferably, an isolated promoter of the present invention 
comprises the nucleotide sequence of SEQ ID NO:5, a nucleic acid molecule 
substantially identical to SEQ I D NO:5, or a 20 base pair nucleotide sequence 

15 identical to a contiguous 20 base pair nucleotide portion of SEQ ID NO:5. 

The present invention further provides a chimeric gene comprising an 
EP1 7 promoter region operably linked to a heterologous nucleotide sequence. 
Preferably, the EP1 7 promoter region comprises the nucleic acid molecule of 
SEQ ID NOs:1 or 5, or functional portion thereof. In a preferred embodiment, 

20 a chimeric gene of the invention is carried in a vector and expressed in a host 
ceil including but not limited to a bacterial cell, a hamster cell, a mouse cell, or 
a human cell. 



transgene that comprises a chimeric gene of the present invention. In a 
25 preferred embodiment, expression of the chimeric gene alters fertility of the 
host animal. 

The present invention also provides a method for identifyinig a substance 
that regulates EP1 7 expression using a chimeric gene that includes an isolated 
. . . - EP1 7 promoter region operably linked to a reporter gene. According to this 
30 method, a gene expression system is established that includes the chimeric 
gene and components required for gene transcription and translation so that 
reporter gene expression is assayable. To select a substance that regulates 




The present invention also provides a transgenic animal having a 
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EP17 expression, the method fuilher provides the steps of u gene 
expression system t6 determine a baseline level of reporter giene explression 
in the absence of a candidate regulator, providing a plurality of candidate 
regulators to the gene expression system, and assaying a level of reporter 
5 gene expression in the presence of a candidate regulator. A candidate 
regulator is selected whose presence results in an altered level of reporter 
gene expression when compared to the baseline level. Preferably, the isolated 
EP1 7 promoter region used in this method comprises the sequence of SEQ ID 
NOs:1 or 5, or functional portion thereof. 

10 In another aspect of the invention, a method is provided for producing 

an epididymal cell line using a chimeric gene comprising an EP1 7 promoter 
operably linked to a gene encoding a selectable marker. Actx^rding to the 
method, a transgenic animal is generated that expresses a selectable marker 
gene. Preferably, the selectable marker gene is an antibiotic resistance gene. 

15 : More preferably^ the antibiotic resistance gene is a neomycin resistance gene. 
Epididymal cells are procured from the transgenic animal and^ stably 
reproduced in cell culture using selection of the marker gene. Preferably, the 
EP1 7 promoter region used to perform this method is the nucleic acid molecule 
of SEQ ID NO:1 , or functional portion thereof. 

20 Another aspect of the present invention pertains to a method for 

mutagenizlng an EP1 7 locus by homologous recombination. The method uses 
a targeting vector having an isolated EP17-promoter region, a marker gene, 
and an isolated EP17 3' flanking region. In a vector so constructed^ the marker 
gene is positioned between the promoter region and the 3' flanking region. In 

25 one embodiment, the.targeting vector further comprises a mutant EP1 7 coding 
sequence, also poisitioned between the promoter region and the 3' flanking 
region, the targeting vector is linearized by digestion with a restriction 
endonuclease at a site other than within the promoter region, marker gene, 3' 
flanking region i and optional mutant EP17 coding sequence. The linearized 

30 vector is introduced into embryonic stem cells and is assayed by detecting the 
marker gene in the stem cells. Stem cells bearing the vector are used to create 
a transgenic vertebrate animal. According to the method, a homologous 
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recombination event is mediated at the EP1 7 locus, thereby exchanging native 
mEPI 7 gene sequences positioned between the prohioter region and the 3' 
flanking region with vector nucleotide sequences positioned the sartie. In a 
more pretended embodiment, male EP1 7 mutant animals produced by the 
5 disclosed method are sterile. 

The present invention also discloses a human EP1 7 polypeptide and an 
isolated nucleic acid sequence encoding the same. Preferably, an isolated 
EP17 polypeptide, or functional portion thereof, comprises a polypeptide 
encoded by the nucleic acid molecule of SEQ ID NO:3, a polypeptide encoded 

10 by a nucleic acid molecule that is substantially identical to SEQ ID NO:3, a 
polypeptide fragment encoded by a 20 nucleotide sequence that is identical to 
a contiguous 20 nucleotide sequence of SEQ ID NO:3; a polypeptide having 
an amino acid sequence of SEQ ID NO:4, a polypeptide that is a biological 
equivalent of the polypeptide of SEQ ID NO:4, or a polypeptide that is 

15 immunologically cross-reactive with an antibody that shows specific binding 
with a polypeptide comprising some or all amino acids of SEQ ID NO:4. 
Preferably, the polypeptide of the present invention comprises a human EP17 
polypeptide. 

The present invention further teaches chimeric genes having a 
20 heterologous promoter that drives expression of a nucleic acid sequence 
encoding an EP17 polypeptide. Preferably, the chimeric gene is carried in a 
vector and introduced into a host cell so that an EP1 7 polypeptide of the 
present invention is produced^ Preferred host ceils include but are not limited 
to a bacterial cell, a hamster cell, a mouse cell, or a human cell. 
25 In another aspect of the invention, a method is providied for detecting a 

nucleic acid molecule that encodes an EP1 7 polypeptide. According to the 
method, a biological sample having nucleic acid material is hybridized under 
stringent hybridization conditions to an EP1 7 nucleic acid rnolecule of the 
present invention. Such hybridization enables a nucleic acid molecule of the 
30 biological sample and the EP1 7 nucleic acid molecule to form a detectable 
duplex structure. Preferably, the EP17 nucleic acid molecule includes some 
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or all nucleotides of 3EQ ID Nds:1/2; 3, or 5. Also preferably, the biological 
sample cornprises human nud^ 

The present invention further teaches an antibody that specifically 
recognizes an EP1 7 polypeptide: Preferably, the antibody recognizes some 
5 or ail amino acids of SEQ ID NO:4. A method for producing an EP1 7 antibody 
is also disclosed, and the method comprises recombinantly or synthetically 
producing an EP17 polypeptide, or portion thereof; formulating the EP17 
polypeptide so that it is an effective immunogen; immunizing an animal with the 
formulated polypeptide to generate an immune response that includes 
0 production of EP1 7 antibodies; and collecting blood serum from the immunized 
- animal containing antibodies that specifically recognize an EP1 7 polypeptide. 
Preferably, the EP17 polypeptide used as an immunogen includes some or all 
amino acid sequences of SEQ ID NO:4. 

A method is also provided for detecting a level of EP17 polypeptide 
5 V. using an antibody that specifically recognizes an EP1 7 polypeptide. According 
to the method, a biological sample is obtained from an experimental subject 
and a control subject, and EP1 7 polypeptide is detected in the sample by 
immunochemical reaction with the EP17 antibody. Preferably, the antibody 
recognizes amino acids of SEQ ID NO:4 and is prepared according to a 
method of the present invention for producing such an antibody. 

The present invention further discloses a method for identifying a 
compound that modulates EP17 function. The method comprises: e>cposing 
an isolated EPi 7 polypeptide to a plurality of compo assaying binding 

, of a compound to the isolated EPI 7 polypeptide. A compound is selected that 
demonstrates specific binding to the isolated EP1 7 polypeptide. Preferably, the 
EP17 polypeptide used in the binding assay of the method includes some or 
ail amino acids of SEQ ID NO:4. 

The present invention further provides a method for modulating EP17 
. function in a subject. According to the method, a pharmaceutical composition 
is prepared that includes a substance capable of modulating EP1 7 expression 
or function, and a camer. An effective dose of the pharinaceutical composition 
is administered to a subject, whereby EP1 7 activity is altered in the subject In 
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a preferred Embodiment, the substance used to perform this method shows 
specific binding to some or all amino acids' of SEQ ID NO:4 and was 
discovered by a screening assay method of the present invention. In anbther 
embodiment, EP1 7 function is disrupted by immunizing a subject with ah 
5 effective dose of the disclosed EP17 polypeptide. The immune system of the 
subject produces an antibody that specifically recognizes the EP17 
polypeptide, and preferably recognizes some or all of amino acids of SEQ ID 
NO:4. In a further embodiment, a gene therapy vector is used, the vector 
comprising a nucleotide sequence encoding an EP17 polypeptide. 

10 Alternatively, the gene therapy vector comprises a nucleotide sequence 
encoding a nucleic acid molecule, a peptide, or a protein that interacts with an 
EP17 nucleic acid or polypeptide. Preferably, the subject is a human subject. 

A method Is also provided for expressing a nucleotide sequence of 
interest in epididymis using an EP17 promoter region. According to the 

15 method, a gene therapy vector is prepared comprising an EP17 promoter 
region operably linked to a nucleotide sequence of interest. A gene therapy 
vector so-constructed is administered to a subject, whereby the nucleotide 
sequence of interest is expressed in epididymis. Preferably, the EP17 
promoter comprises SE ID NO:5, or functional portion thereof . Also preferably, 

20 the subject is a human subject 

The invention further provides a method for diminishing the fertile 
capacity of a subject. According to the method, a chemical compound, peptide, 
or antibody that interacts with an EP1 7 polypeptide is identified. Preferably, the 
polypeptide is the sequence of SEQ ID NO:4 or 6: A pharmaceutical 

25 preparation is prepared comprising such a chemical compound, peptide, or 
antibody, and. a carrier. An effective dose of the pharmaceutical composition 
is administered to a subject, whereby the fertile capacity of the subject is 
diminished. 

The invention further provides a method for promoting the fertile capacity 
30 of a subject. In this case, a chemical compound or peptide that interacts with 
an EP1 7 polypeptide is identified. Preferably, the polypeptide is the sequence 
of SEQ ID NO:4 or 6. A pharmaceutical composition comprising the chemical 
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compouiid or peptide and a carrier is prepared. Ah effective dose of the 
phanriaceutical composition is administered to 'a subject, whereby tlie fertile 
capacity of ttie subject is improved. 

. Accordingly, H is an object of the present invention to provide novel 
EP17 nucleic acid and polypeptide sequences, and novel; methods relating 
thereto. This object is achieved in whole or in part by the present invention. 

An object of the invention having been stated above, other objects and 
advantages of the present invention will become apparent to those skilled in 
the art after a study of the following description of the invention. Figures and 
non-limiting Examples. 

Brief Description of the Drawings 
Figure 1 depicts genomic organization of the mEP1 7 gene. mEP1 7 is 
: located upstream from mE-RABP within the locus[A3,B] of the mouse 
: ^ chromosome 2. Exon sizes are indicated in nucleotides. The major 
1 5 transcription initiation sites of both genes are represented with broken arrows. 
Primer TwmEP17cDNA" (SEQ ID NO:7) was used for primer extension 
analysis. Two motifs G-X-W and T-D-Y and two cysteine residues (C) that 
contribute to the three dimensional structure of lipocalin proteins are also 
indicated. 

20 Figure 2A presents a Northern blot showing epididymis-specific 

expression of the mEP1 7 gene. Total RNA was extracted from individual 
tissues and hybridized with [^?P]^abeled mEPI 7 cDNA. Two major transcripts 
of i kb and 3.1 kb in size were, detected only in the epidi 

Figure 2B shows Northem blot analysis of total RNA extracted from the 

25 epididymis, hybridized with :[??P]-labeled intron 1 of tha mEP17 gene or with 
pP]Tlabeled mEP17 cDNA used as probes. The intron 1 probe only detected 
the 3.1 kb transcripts, suggesting that these transcripts are likely unspliced 
mEP 17 precursor RNA, 

. Figure 3 shows region-specific expression of the mEP17 gene in the 

30 initial segment of the epididymis. In situ hybridization of mEP1 7 transcripts is 
detected in the initial segment (IS) but not in the efferent duct (ED) and 
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mid/distal caput epididymis (Cp). 

Figures 4A and 4B show 7n situ hybridization of mEPI 7 in epidldymal 
tissue, and also show ceil*specific expression of the mEP1 7 gene- 
Figure 4A shows a high magnification view of the boxed region of Figure 
5 3 at the boundary between the initial (IS) and proximal caput epididymis (Cp). 
mEPI 7 mRNA is highly expressed only in the principal cells of the initial 
segment (IS). No staining is observed in the conjunctive tissue (CT) and in the 
epithelial cells of the proximar caput epididymis (Cp). 

Figure 4B shows hybridization of a section of the initial segment with a 
10 sense strand digoxygenin-labeled mEP17 RNA. No signal is detected. 

Figure 5 presents a comparison of the genomic structure of the mouse 
and human EP17 genes. The major transcription initiation sites (TIS) of both 
genes are indicated by broken arrows. The lipocalin-specific motifs (G-X-W, 
T-D-Y, and 2 cysteine residues) are also indicated. Black boxes indicate 
1 5 exons, and the line region between the boxes indicate introns. Numbers below 
the boxes and line regions indicate exon and intron sizes in base pairs. 

Figure 6 presents BLAST results using human EP17 cDNA sequence 
of SEQ ID NO:3 as the query sequence. The highest homologies were 
observed with other epididymal lipocalins (E-RABP and prostaglandin H2-D 
20 isomerases). 

Figure 7 presents a comparison of the amino acid sequences of mouse 
EP17 and human EP1 7 proteins. Conserved iipocalin motifs are indicated. 
The mouse arid human EP1 7 proteins share 61 % overall identity. 

Figure 8 shows hydropathic analysis of the murine and human EP17 
25 proteins. 

Figure 9 depicts a restriction enzyme map of plasmids derived from BAC 
clone 10983 aligned with the mEPI 7/mE-RABP genomic regio The promoter 
region of mEP17, mEP17 exons, the intergenic region, and mE-RABP exons 
• are indicated. 

30' Figure 10 shows primer extension analysis of the 5' end of mEP17 

mRNA. Total RNA extracted from the epididymis (Ep) or transfer (t) RNA was 
reverse transcribed with pPKadiolabeled mEP17PE2 primer (SEQ ID NO:7) 
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ahd extended using Avian Myeloblastbsls vinjs (AmV) reverse transcriptase. 
Lanes labeled '^C^ T"; "A" and "G^are [^SK^^ 

reactions earned out using the mEP17PE2 primer (SEQ ID NO:7) and the 
pHindlll clone (indicated in Figure 9) as template. The localization of two major 
5 (arrows) and two minor (arrowheads) transcriptiohini^^^^^ 

Figure 1 1 shows the nucleotide sequence of the mEP1 7 5.3 kb promoter 
region. Putative cis-DNA regulatory elements within the 5' flanking region are 
underlined, including binding sites for androgen receptor (ARSB), retinoic acid 
receptor (RARE). Stimulating Protein 1 (SP-1), Activator Protein 1 (AP-1), 
10 Oetamer transcription factor 1 (Oct-1 ), and Sox-5 (SRY-related Sequence #5 
Protein). A consensus TATA box is indicated. 

Figure 12 depicts constructs used in functional assays of the mEP17 
promoter, each construct comprising a different fragment of the 5.3 kb mEPI 7 
: promoter region (solid lines), an open reading frame encoding an exemplary 
15 .- reporter gene (chloramphenicol acetyltransf erase, solid bar labeled "CAT") 
operably linked to a promoter fragment, and the polyA tail region of Simian 
virus 40 large T antigen. 
, / : Figure 1 3 shows the nucleotide sequence of the human EP1 7 promoter 

region. Putative cis-DNA regulatory elements are underiined, including a 
20 Stimulatory Protein 1 (Sp-1 ) binding site, an Activator Protein 1 (AP-1 ) binding 
site, a cAMP response element binding protein (CREB) site, a SRY-related 
@ HMG-box gene (Sox5) binding site, a Sex^determining region Y gene product 

(SRY) binding site, a c-Ets binding site, a GATA binding site, and an Oetamer 
transcription factor 1 (Oct-1 ) binding site. 
26 Figure 14 presents a comparison of the putative cis-DNA regulatory 

elements in the mouse and human EP17 promoter regions; 

Figures 15A, 15B, arid. 1;5C present experiments demonstrating 
honmonal regulation of mEPI 7 transcription. 

Figure 15A shows Northern blot analysis of epididymal total RNA (10 
30 ixg/lane) extracted from intact (I) and castrated animals at 5, 10 , 20, and 30 
days following castration (C5, CI 0, C20 and C30 respectively), hybridized with 
pP]-labeled mEPI 7 cDNA. 
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/ Figure 15B shows Northern blot analysis to detect mEP17 RNA four 
days following hernicastraitidn. Levels of niEPI 7 RNA in tiie epididymis of the 
castrated slide (HI) aroTeduced to 0.7% of RNA levels in the epididymis of the 
non-castrated side (HC). 
5 Figure 1 5C shows Northern blot analysis to detect mEP1 7 cDNA 5 days 

after castration (C5) and 5 days after castration and androgen replacement 
(TP). 

Figure 1 6 depicts homologous recombination at the mEPI 7 locus using 
the pLN-1 7 vector. The mEP17/mE-RABP genomic region is presented at the 

1 0 top. mEPI 7 exons are indicated by hatched rectangles. mE-RABP exons are 
indicated by open rectangles. The mEP17 targeting plasmid pLN-17 is 
designed so that 1 .4 kb of mEP17 5' flanking region is positioned immediately 
upstream of the vector PGK neomycin sequence, and 10.9 kb of mEP17 3* 
flanking region and mE-RABP gene is positioned immediately downstream of 

1 5 the vector PGK neomycin sequence. mEP1 7 sequences carried in the pLN-1 7 
targeting vector mediate homologous recombination, depicted as an "X" 
between the genomic region and the targeting plasmid. The recombination 
event creates a genomic reorganization wherein the entire mEPI 7 coding 
sequencers replaced by the PGK neomycin sequence. 

20 Figure 17 shows recombinant production of mEPI 7 protein using the 

pBAD/glll vector (I nvitrogen). Protein extracted from E.co// transformed with 
pBAD/glll-mEP17 is resolved by polyacrylamide gel electrophoresis. 

Figure 17A ishows Coomassie blue staining that identifies two enriched 
protein species (boxed). 

25 Figure 17B shows Westem blot analysis using an anti-his tag antibody 

to detect two recombinant proteins of approximately 21 and 23 kDa, 
: corresponding to the processed and non-processed mEPI 7 isdforms. 

Detailed Description of the Invention 
The present invention provides isolated nucleic acids comprising a 
30 lipocalin gene promoter region (representative embodiments set forth as SEQ 
ID NOs:1 and 5), isolated nucleic acids comprising a human lipocalin gene (a 
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representative embodiment set foiih as SEQ ID NO:2), isolated nuclelb acids 
encoding a Hpocalin polypeptide (a representative embodimehV^ forth as 
SEQ ID NO:3), isolated iipocalin polypeptides (a representative embodinrieht 
set forth as SEQ ID NO:4), and uses thereof. The disclosed iipocalin nucleic 
5: acids and polypeptides can be used according to methods of the present 
invention to generate a mouse model of male infertility, for dmg discovery 
screens, and for therapeutic treatment of fertility-related conditions, among 
other uses. 
L DEFINITIONS 

10 While the following terms are believed to be well understood by one of 

ordinary skill in the art, the following definitions are set forth to facilitate 
explanation of the invention. The entire contents of all publications mentioned 
herein, including the discussion of the background art presented above, are 
hereby fuliy incorporated by reference. 

15 LA. EP1 7 nucleic acids 

The nucleic acid molecules provided by the present invention include the 
isolated nucleic acid molecules of SEQ ID NOs:1, 2, 3, and 5, sequences 
substantially similarto sequences of SEQ ID NOs:1 , 2, 3, and 5, conservative 
variants thereof, subsequences and elongated sequences thereof, 

20 complementary DNA molecules, and corresponding RNA molecules. The 
present invention also encompasses genes, cDNAs, chimeric genes, and 
vectors comprising disclosed EP1 7 nucleic acid sequences. 

The term "nucleic acid rtiolecule" refers to deoxyribonucleotides or 
ribonucleotides and polymers thereof in either single- or doublorstranded form. 

25 Unless specifically limited, the term encompasses nucleic acids containing 
known analogues of natural nucleotides which have similar propertiiss as the 
reference natural nucleic acid. Unless otherwise indicated, a particular 
nucleotide sequence also implicitly encompasses conservatively modified 
variants thereof (e.g. degenerate codon substitutions), complementary 

30. sequences, subsequences, elongated sequences, as well as the sequence 
explicitiy indicated. The terms "nucleic acid molecule" or "nucleotide sequence" 
can also be used in place of "gene", "cDNA", or "mRNA". Nucleic acids can be 
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derived from any source, ^^i^^ 

The term "isolated", as used in the isontekt of a hucleic acid molecule, 
indicates that the nucleicacid molecule exists apart from its native environment 
and is not a product of nature. An isolated DNA molecule can exist in a purified 

5 form or can exist in a non-native environment such as a transgenic host cell. 

The term "purified", when applied to a nucleic acid, denotes that the 
nucleic acid is essentially free of other cellular components with which it is 
associated in the natural state. Preferably, a purified nucleic acid molecule is 
a homogeneous dry or aqueous solution. The term "purified" denotes that a 

0 nucleic acid or protein gives rise to essentially one band in an electrophoretic 
gel. Particulariy, it means that the nucleic acid is at least about 50% pure, more 
preferably at least about 85% pure, and most preferably at least about 99% 
pure. 

The term "substantially identicar'i the context of two nucleotide or amino 

5 acid sequences, can also be defined as two or more sequences or 
subsequences that have at least 60%, preferably 80%, more preferably 
90-95%, and most preferably at least 99% nucleotide or amino acid sequence 
identity, when compared and aligned for maximum correspondence, as 
measured using one of the following sequence comparison algorithms 

0 (described herein below under the heading Nucleotide and Amino Acid 
Sequence Comparisons ) or by visual inspection. Preferably, the substantial 
identity exists in nucleotide sequences of ait least 50 residues, more preferably 
in nucleotide sequence of at least about 100 residues, more preferably in 
nucleotide sequences of at least about 150 residues, and most preferably in 

;5 nucleotide sequences comprising complete coding sequences. In one aspect, 
polymorphic sequences can be substantially identical sequences. The term 
"polymorphic" refers to the occurrence of two or more genetically determined 
alternative sequences or alleles in a population; An allelic difference can be 
„ . as small as: one base paiir. 

►0 Another indication that two nucleotide sequences are substantially 

identical is that the two molecules specifically or substantially hybridize to each 
other under stringent conditions/ In the context of nudeic acid hybridization. 
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a "target". A "probe" is a reference liudeic add molecule, and a •"target" is a 
test nucleic acid molecule, often found^ w a heterogenous population of 
nucleic acid molecules. A "target sequence" is synonymous with a "test 
5 sequence". 

A preferred nucleotide sequence employed for hybridization studies or 
assays includes probe sequences that are complementary to or mimic at least 
an about 14 to 40 nucleotide sequence of a nucleic acid molecule of the 
present invention. Preferably, probes comprise 14 to 20 nucleotides, or even 
10 longer where desired, such as 30, 40, 50, 60, 100, 200, 300, or 500 
• nucleotidesoruptothefulllengthof janyofSEQID NOs:1,2,3, and 5. Such 
fragments can be readily prepared by, for example, directly synthesizing the 
fragment by chemical synthesis, by application of nucleic acid amplification 
: technology, or by introducing selected sequences into recombinant vectors for 
15 recombinant production. The phrase "hybridizing specifically to" refers to the 
binding, duplexing, or hybridizing of a molecule only to a particular nucleotide 
sequence under stringent conditions when that sequence is present in a 
V complex nucleic acid mixture (e.g., total cellular DNA or RNA). The phrase 
"binds substantially to" refers to complementary hybridization between a probe 
20 nucleic acid molecule and a target nucleic acid molecule and embraces minor 
mismatches that can be accommodated by reducing the stringency of the 
hybridization media to achieve the desired hybridization. 

"Stringent hybridization conditions" and "stringent hybridization wash 
v : conditions" in the , context of nucleic acid hybridization experiments such as 
25 - Southern and Northern blot analysis are; both sequence- and environnhent- 
dependent. . Longer sequences hybridize specifically at higher temperatures. 
An extensive guide to the hybridization of nucleic acids is found in Tijssen 
(1993) ^Laboratory Techniques in Biochemistry and Molecular Biology- 
Hybridization with Nucleic Acid Probes"' piart I chapter 2, Elsevier, New York. 
30 Generally, highly stringent hybridization and wash conditions are selected to 
be about 5°C lower than the thermal melting point (T^) for the specific 
sequence at a defined ionic strength and pH. Typically, under "stringent 
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„ conditions" a probe will hybridize specifically to its target subsequence, but to 
no other sequences. 

TheT^ Is the temperature (under defined ionic strength and pH) at which 
50% of the target sequence hybridizes to a perfectly matbhed probe. Very 
5 stringent conditions are selected to be equal to the for a particular probe. 
An example of stringent hybridization conditions for Southern or Northern Blot 
analysis of complementary nucleic acids having more than about 100 
complementary residues is ovemight hybridization in 50% formamide with 1 mg 
of heparin at 42^*0. An example of highly stringent wash conditions is 15 

1 0 minutes in 0. 1 5 M NaCI at 65°C. An example of stringent wash conditions is 1 5 
minutes in 0.2X SSC buffer at 65^C (See Sambrook (1989) for a description 
of SSC buffer). Often, a high stringency wash is preceded by a low stringency 
wash to remove background probe signal. An example of medium stringency 
wash conditions for a duplex of more than about 1 00 nucleotides, is 1 5 minutes 

15 in IX SSC at 45^C. An example of low stringency wash for a duplex of more 
than about 100 nucleotides, is 15 minutes in 4-6X SSC at 40^C. For short 
probes (e.g., about 1 0 to 50 nucleotides), stringent conditions typically involve 
salt concentrationsof less than about 1 .0 M Na ion, typically about 0.01 to 1.0 
M Na ion concentration (or other salts) at pH 7.0-8.3, and the temperature is 

20 typically at least about 30°C. Stringent conditions can also be achieved with the 
addition of destabilizing agents such as formamide. In general, a signal to 
noise ratio of 2-fold (or higher) than that observed for an unrelated probe in the 
particular hybridization assay indicates detection of a specific hybridization. 
The following are examples of hybridization and wash conditions that 

25 can.be used to clone homologous nucleotide: sequences that are substantially 
identical to reference nucleotide sequences of the present invention: a probe 
nucleotide sequence preferably hybridizes to a target nucleotide sequence in 
7% sodium dodecyl sulfate (SDS), 0.5 M NaR04, 1 mM EDTA at 50° C followed 
. by washing in 2X SSC, 0.1 % SDS at 50°C; more preferably , a probe and target 

30 sequence hybridize in 7% sodium dodecyl sulfate (SDS). 0.5 M NaP04. 1 mM 
EDTA at 50"C followed by washing in IX SSC, 0.1% SDS at 50*^C; more 
preferably, a probe and target sequence hybridize in 7% sodium dodecyl 



DOaO: <WO_02053701 A2J_> 



\VO02/053701 



PCT/USOl/49639 



-21- 

sulfate (SDS), 0.5 M NaP04; 1 rriM E at 50^G followed by washing in 0.5X 
SSC, 0.1% SOS at 5p°C; more preferably, a' probe and target sequence 
hybridize in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 
50X followed by washing in 0.1X SSC, 0.1% SDS at 50X; more prefer^^^ 
5 a probe and target sequence hybridize in 7% sodium dodecyl sulfate (SDS), 
•0.5 M NaP04. 1 mM EDTA at 50"C followed by washing in 0.1X SSC, 0.1% 
SDSat65°C. 

A further indication that two nucleic acid sequences are substantially 
identical is that proteins encoded by the nucleic acids are substantially 
10 identical, share an overall three-dimensional structure, are biologically 
functional equivalents; or are immunologically cross-reactive. These terms are 
defined further under the heading EP17 Polypeptides herein below. Nucleic 
acid molecules that do not hybridize to each other under stringent conditions 
are still substantially identical if the corresponding proteins are substantially 
15 identical. This can occur, for example, when two nucleotide sequences are 
significantly degenerate as permitted by the genetic code. 

The term "conservatively substituted variants" refers to nucleic acid 
sequences having degenerate codon substitutions wherein the third position 
of one or more selected (or all) codons is substituted with mixed-base and/or 
20 deoxyinosine residues (Batzer et al. (1991) TVuc/e/c >Ac/d Res. 19:5081; 
Ohtsuka et al. (1985) J Biol CA7em 260:2605-2608; Rossolini et al. (1994) Mol 
Cell Probes &:91'98y 

The term "subsequenceV refers to a sequence of nucleic acids that 
comprises a part of a longer nucleic acid sequence. An exemplary 
25 subsequence is a probe, described herein above, or a primer. The term 
"primer" as used herein refers to a contiguous sequience comprising about 8 
or more deoxyribonudebtides or ribonudeotides, preferably 1 0-20 nudeotides, 
and more preferably 20-.30/nucleotides of a selected nucleic acid rholecule. 
The primers of the invention encompass oligonucleotides of sufficient length 
30 and appropriate sequence so as to provide initiation of polymerization on a 
nucleic acid molecule of the present invention. 

The term "elongated sequence" refers to an addition of nucleotides (or 
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pther analogous moleculeis) incorporated Into the nucleic acid. For exarnple, 
a polymerase (e.g., a DNA polymerase), e.g., a pdlyrherase which adds 
sequences at the 3* terminus of the nucleic add molecule. In addition, the 
nucleotide sequence can be combined with other DNA sequences, such as 

5 promoters, promoter regions, enhancers, polyadenylation signals, intronic 
sequences, additional restriction enzyme sites, multiple cloning sites, and other 
coding segments. 

The term "complementary sequence", as used herein, indicates two 
nucleotide sequences that comprise antiparallel nucleotide sequences capable 

0 of pairing with one another upon formation of hydrogen bonds between base 
pairs. As used herein, the term "complementary sequences" means nucleotide 
sequences which are substantially complementary, as can be assessed by the 
same nucleotide comparison set forth above, or is defined as being capable of 
hybridizing to the nucleic acid segment in question under relatively stringent 

5 conditions such as those described herein. A particular example of a 
complementary nucleic acid segment is an ahtisense oligonucleotide. 

The term "gene" refers broadly to any segment of DNA associated with 
a biological function. A gene encompasses sequences including but not limited 
to a coding sequence, a promoter region, a cis-regulatory sequence, a non- 

0 expressed DNA segment is a specific recognition sequence for regulatory 
proteins, a non-expressed DNA segment that contributes to gene expression, 
a DNA segment designed to have desired parameters, or combinations 
thereof. A gene can be obtained by a variety of methods, including cloning 
from a biological sample, synthesis based on known or predicted sequence 

5 information, and recombinahtderivation of an existing sequence. 

The term "promoter region- defines a nucleotide sequence within a gene 
that is positioned 5* to a coding sequence of a same gene and functions to 
direct transcription of the coding sequence. The promoter region includes a 
. .. transcriptional start site and at least one cis-regulatory element. The present 

0 invention encompasses nucleic acid sequences that comprise a promoter 
region of an EP 17 gene, or functional portion thereof. 

The term "ds-acting regulatory sequence" or "cis-regulatory motif or 
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"response elemenf i as used herein, each referto a nucieotide Sequence that 
enables responsiveness to a regulatory transcnption factor. Riesponsiiveness 
can encompass a decrease or an increase in transcriptional output and is 
mediated by binding of the:transcription factor to the DNA molecule comprising 
5 the response element. 

The term "transcription factor" generally refers to a protein that 
modulates gene expression by interaction with the cis-regulatory element and 
cellular components for transGription, including RNA Polymerase, Transcription 
Associated Factors (TAFs), ohromatin-remodeling proteins, and any other 
10 relevant protein that impacts gene transcription. 

The term "gene expression" generally refere to the cellular processes by 
vifhich a biologically active polypeptide is produced from a DNA sequence. 

A "functional portion" of a promoter gene fragment is a nucleotide 
sequence within a promoter region that is required for normal gene 
15 ^transcription. To determine nucleotide sequences that are functional, the 
expression of a reporter gene is assayed when variably placed under the 
direction of a promoter region fragment. 

Promoter region fragments can be conveniently made by enzymatic 
digestion of a larger fragment using restriction endoriucleases or DNAse I. 
20 Preferably, a functional promoter region fragment comprises about 5000 
nucleotides, more preferably 2000 nucleotides, more preferably abouti 000 
nucleotides, more preferably a functional promoter region fragment comprises 
about 500 nucleotides, even more preferably a functional promoter region 
fragment comprises about 100 nucleotidesv and even more preferably a 
25 functional promoter region fragment comprises about 20 nucleotides. 
. The terms "reporter gene" or "niarker gene" or "^^ 
refer to a heterologous gene encoding a product that is readily observed and/or 
quantitated. A reporter gene is heterologous in that it originates from a source 
. j: foreign to an intended host cell or, iffrom the same source, is modified from its 
30 original form. Non-limiting examples of detectable reporter genes that can be 
. pperably linked to a transcriptional regulatory region can be found in Alam and 
Cook (1990) Ana/ B/oc/7em 188:245T254and PCT International Publication No. 
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WO 97/47763. PreTerfed repbiter geiies for transcriptional analyses Include 
the lacZ gene (See , e.g.. Rose and Botstein (1983) Meth Enzymol 101:167- 
1 80), Green Fluorescent Protein (GFP) (Gubitt et al. (1995) Trends Biochem 
Sc/ 20:448^455). lueiferase, or dhloraniphenl^^ 
5 Preferred reporter genes for methods to produce transgenic animals include 
but are not linlited to antibiotic resistance genes, and more preferably the 
antibiotic resistance gene confers neomycin resistance. Any suitable reporter 
and detection method can be used, and it will be appreciated by one of skill in 
the art that no particular choice is essential to or a limitation of the present 
10 invention. 

An amount of reporter gene can be assayed by any method for 
qualitatively or preferably, quantitatively detennihing presence or activity of the 
reporter gene product. The amount of reporter gene expression directed by 
each test promoter region fragment Is compared to an amount of reporter gene 

1 5 expression to a control construct cornprising the reporter gene In the absence 
of a promoter region fragment. A pronioter region fragment is identified as 
having promoter activity when there is significant increase in an amount of 
reporter gene expression in a test construct as compared to a control construct. 
The term "significant increase", as used herein, refers to an quantified change 

20 in a measurable quality that is larger than the margin of error inherent In the 
measurement technique, preferably an increase by about 2-fold or greater 
relative to a control measurement, more preferably an increase by about 5-foild 
or greater, and most preferably an increase by about 1 0-fold or greater. 
The preserit invention also ein(»rnpasses chimeric genes CO 

25 disclosed EP1 7 sequences. The terriri "chimeric gene", as used herein, refers 
to an EP17 promoter region operably linked to ah open reading firame, wherein 
the nudeotide sequence created is not naturally occurring; In thl^ 
open reading frame is also described as a "heterologous sequence". The term 
"chimeric gene" also encompasses a promoter region operably linked to an 

30 EP17 coding sequence, a nucleotide sequence producing ah antisehse RNA 
molecule, a RNA molecule having tertiary structure, such as a hairpin structure, 
or a double-stranded RNA molecule. 
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The tenti ^operably Jinked", as used herein, refers to a.promoter region 
that is connected to a nucleotide sequ 

of that hucieotide sequence is controlled and regulated by that promoter region. 
Techniques for opefratively linking a promoter region to a nucleotide sequence 
5 are well known in the art. 

The terms "heterologous gene", "heterologous DNA sequence", 
"heterologous nucleotide sequence", "exogenous nucleic acid molecule", or 
"exogenous DNA segment', as used herein, each refer to a sequence that 
originates from a source foreign to an intended host cell or, if from the same 

10 source, is modified from its original form. Thus, a heterologous gene in a host 
cell includes a gene that is endogenous to the particular host cell but has been 
modified, for example by mutagenesis or by isolation from native cis-regulatory 
sequences. The terms also includes non-naturally occurring multiple copies of 
a naturally occurring nucleotide sequence. Thus, the terms refer to a DNA 

15 ^segment that is foreign or heterologous to the cell, or homologous to the cell 
but in a position within the host cell nucleic acid wherein the element is not 
ordinarily found. 

The present invention further includes vectors comprising the disclosed 
EP17 sequences, including plasmids, cosmids, and viral vectors. The term 
20 "vector", as used herein refers to a DNA molecule having sequences that 
enable Its replication in a compatible host cell. A vector also includes 
nucleotide sequences to permit ligation of nucleotide sequences within the 
vector, wherein such hucieotide sequences are also replicated in a compatible 
host cell. A vector can also mediate recombinant production of an EP17 
25 polypeptide, as deiscribed further herein below. Preferred vectors include but 
are not limited to pBluescript (Stratagene), pUC1 8, pBL^ (Luckow and 
Schutz (1987) Nucleic Acids Res 15:5490), pLNTK (QonrTian et al. (1996) 
. Immunity 5:241 -252), and pBAD/glll (Stratagene). A preferred host cell is a 
' mammalian cell; more preferably the cell is a Chinese hamster ovary cell, a 
30 HeLa cell, a baby hamster kidney cell, or a mouse cell; more preferably the cell 
is a mouse epididymal cell; even more preferably the cell is a. human cell. 
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Nucieic acids of the present invention can be cloned, synthesized, 
reeonibinantly. altered, mutagenized, or combinations thereof. Standard 
recofnbinant DNA and molecular cloning techniques used to isolate nucleic 
acids are Well knowh in the art; Exemplary, non-limitiiig methods are described 
5 by Sambrook et al., eds. (1989) "Molecular Gtoning^, Co\d Spring Hartxir 
Laboratory Press, Cold Spring Harbor, New York; by Silhavy et al. (1984) 
"Experiments with Gene Fusions", Cold Spring Harbor Laboratory Press, Coid 
Spring Harbor, New York; by Ausubel et aL {^992) Current Protocols in 
Mo/ecu/ar S/o/ogy John Wylie and Sons, Inc. New York; and by Glover, ed. 
10 (1985) "DA/iA Cloning: A Practical Approach", MRL Press, Ltd., Oxford, U.K. 
Site-specific mutagenesis to create base pair changes, deletions, or small 
insertions are also well known in the art as exemplified by publications, see. 
e.g., Adelman et al., (1983) DAW 2:183; Sambrook et al. (1989). 

Sequences detected by methods of the invention can be detected, 
15 subcioned, sequenced^ and further evaluated by any measure well known in 
the art using any method usually applied to the detection of a specific DNA 
sequence including but not limited to dideoxy sequencing, PCR, oligomer 
restriction (Saiki et al;, B/o/Techno/ogy 3:1008-1012 (1985), allele-specific 
oligonucleotide (ASO) probe analysis (Conner et al. (1983) Proc, NatL Acad. 
20 ScL U.S.A. 80:278), and oligonucleotide ligation assays (OLAs) (Landgren et. 
al. (1988) Science 241:1007). Molecular techniques for DNA analysis have 
\ been reviewed (Landgren et. al. (1988) Se/ence 242:229-237). 
L B. EP17 PolVDeDtides 

The polypeptides provided by the present invention include the isolated 
25 polypeptide of SEQ JD NO:4, polypeptides substantially similar to sequences 
of SEQ ID NO:4, EP17 polypeptide fragments, fusion proteins comprising EP17 
amino acid sequences, biologically functional analogs, and polypeptides that 
cross-react with an antibody that specifically recognizes an EP1 7 polypeptide. 
: The term "isolated", as used in the context of a polypep^ indicates 

.30 that the polypeptide exists apart from its native environment and is not a 
product of nature. An isolated polypeptide can exist in a purified form or can 
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exist In a non-native environment such asi for exarriple, in a transgenic host 
' .cell. ■ 

The term "purified*", when applied to a polypeptide, denotes that the 
polypeptide is essentially free of other cellular components with which it is 
5 : associated in the natural state. Preferably, a polypeptide is a homogeneous 
solid or aqueous solution. Purity and homogeneity are typically detiermined 
using analytical chemistry techniques such as polyacrylamide gel 
electrophoresis or high performance liquid chromatography. A polype 
which is the predominant species present in a preparation is substantially 
10 purified. The term "purified" denotes that a polypeptide gives rise to essentially 
^ one band in an electrophoretic gel, Particulariy, it means that the polypeptide 
is at least about 50% pure, more preferably at least about 85% pure, and most 
preferably at least about 99% pure. 

The term "substantially identical" in the context of two or more 
15 polypeptides sequences is measured by (a) polypeptide sequences having 
about 35%, or 45%, or preferably from 45-55%. or more preferably 55-65%, or 
most preferably 65% or greater amino acids which are identical or functionally 
r equivalent. Percent "identity" and methods for determining identity are defined 
herein below under the heading Nucleotide and Amino Acid Sequence 
20 Comparisons . 

Substantially identical polypeptides also encompass two or more 
- polypeptides sharing a conserved three-dimensional structure. Computational 
methods can be used to compiare structural representations, and structural 
superpositions can be generated and easily tuned to identify similarities around 
25 important active sites or ligand binding sites. See Henikoff et al. (2000) 
Electrophoresis 21 (9): 1700-1 706; Huang et al. (2000) Pac Symp Biocomput 
230-241; Saqi et al. (1999) Bioinformatics 15(6):521-522; and Barton (1998) 
Acta Crystallogr D Biol Crystallogr 54:1 1 39-1 1 46. 

. The term "functionally equivalent" in the context of amino acid 
30 sequences Is well known in the art and is based on the relative similarity of the 
amino acid side-chain .substituents. See Henikoff and Henikoff (2000) Adv 
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Protein Chem 54:73^97. Relevant factors for oonsideratibn include side-chain 
hydrophobicity, hydrophilicity, charge, and size. For example, arginihe, lysine, 
andihistidine are all positively charged residues; that alanine, glycine, and 
serine are all of similar size; and that phenylalanine, tryptophan, and tyrosine 
5 all have a generally similar shape. By this analysis, described further herein 
below, arginine, lysine, and histidine; alanine, glycine, and serine; and 
phenylalanine, tryptophan, and tyrosine; are defined herein as biologically 
functional equivalents. 

In making biologically functional equivalent amino acid substitutions, the 

10 hydropathic index of amino acids can be considered. Each amino acid has 
been assigned a hydropathic index on the basis of their hydrophobicity and 
charge characteristics, these are: isoleucine (+ 4.5); valine (+ 4.2); leucine (+ 
3.8); phenylalanine (+ 2.8); cysteine (+ 2.5); methionine (+ 1 .9); alanine (+ 1 .8); 
glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); 

15 proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (- 
3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

The importance of the hydropathic amino acid index in conferring 
interactive biological function on a protein is generally understood in the art 
(Kyte et al. (1982) J Mol Biol 157:105.). It is known that certain amino acids 

20 can be substituted for other amino acids having a similar hydropathic index or 
score and still retain a similar biological activity. In miaking changes based upon 
the hydropathic index, the substitution of amino acids whose hydropathic 
Indices are within ±2 of the original value is preferred, those which are within 
±1 of the original value are particulariy preferred, and those within ±0.5 of the 

25 original value are even more particulariy preferred. 

It Is also understood in the art that the. substitution of like amino acids 
can be made effectively on the basis of hydrophilicity. U.S. Pat. No. 4,554, 101 
states that the greatest local average hydrophilicity of a proteiri, as governed 
. - - by the hydrophilicity of its adjacent amino acids, correlates with its 

30 irhmuriogenicity and antigenicity, /.e. with a biological property of the protein. 
It is understood that an amino acid can be substituted for another having a 
similar hydrophilicity value and still obtain a biologically equivalent protein. 
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As detailed in U.S; Pat. No. 4,554,101, the following hydi^ophilicity 
values have been assigned to amino acid residues: arginine (+ 3.0); lysine (+ 
3.0); aspartate (+ 3-0±1); glutamate (+ 3.0±1); serine (+ 0.3); asparagine (+ 
0.2); glutamine (+ 0.2); glycine (0); threonine (-04); proline (-0.5+1); alanine (- 
5 0.5); histidine (-0.5); cysteine (-1 .0); methionine (-1 .3); valine (-1 :5); leucine (- 
. 1.8); isoieucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4) . 

In making changes based upon similar hydrophilicity values, the 
substitution of amino acids whose hydrophilicity values are within ±2 of the 
. original value is preferred, those which are within ±1 of the original value are 
1 0 particularly preferred, and those within ±0^5 of the original value are even more 
particularly preferred. 

The present invention also encompasses EP17 polypeptide fragments 
or functional portions of an EP17 polypeptide. Such ftjnctional portion need 
not comprise all or substantially all of the amino acid sequence of a native 
15 I, lipocalin gene product. Theterm "functional" includes any biological activity or 
feature of EP1 7, including immunogenicity. 

The present invention also includes longer sequences an EP17 
. polypeptide, or portion thereof. For example, one or more amino acids can be 
added to the N-terminal or C-terminal of an EP1 7 polypeptide. Fusion proteins 
20 comprising EP1 7 polypeptide sequences are also provided within the scope of 
the present invention. Methods of preparing such proteins are known in the art. 

The present invention also encompasses functional analogs of an EP1 7 
polypeptide. Functional analogs share atleast one biological function with an 
EP1 7 polypeptide- An exemplary function is immunogenicity. In the context 
25 of amino acid sequence, biologically functional analogs, as used herein, are 
peptides in which certain, but not most or all, of the amino acids can be 
substituted. Functional analogs can be created at the level of the 
corresponding nucleic acid molecule, altering such sequence to encode 
. desired amino acid changes. In one erribodimerit, changes can be introduced 
30 to improve the antigenicity of the protein^ In another embodiment, an EP17 
polypeptide sequence is varied so as to assess the activity of a mutant EP17 
polypeptide. 
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The present Invention also encompasses reconiblnant prbduction of the 
disclosed EP1 7 polypeptides. Briefly, a nucleic acid sequence ericDdihg an 
EP17 polypeptide, pr portion thereof, is cloned into a expression cassette, the 
cassette is introduced into a host organism, where it is recombinantly 
5 produced. 

The term "expression cassette" as used herein means a DNA sequence 
capable of directing expression of a particular nucleotide sequence in an 
appropriate host cell, comprising a promoter bperably linked to the nucleotide 
sequence of interest which is operably linked to termination signals. It also 

1 0 typically comprises sequences required for proper translation of the nucleotide 
sequence. The expression cassette comprising the nucleotide sequence of 
interest can be chimeric. The expression cassette can also be one which is 
naturally occurring but has been obtained in a recombinant form useful for 
heterologous expression. 

15 The expression of the nucleotide sequence in the expression cassette 

can be under the control of a constitutive promoter or an inducible promoter 
which initiates transcription only when the host cell is exposed to some 
particular external stimulus. Exemplary promoters include Simian virus 40 
early promoter, a long terminal repeat promoter from retrovims, an action 

20 promoter, a heat shock promoter, and a metallothionein protein. In the case 
of a multicellular organism, the promoter and promoter region can direct 
expression to a particular tissue or organ or stage of development. Exemplary 
tissue-specific promoter regions include a mE-RABP promoter arid an EPi7 
prornoter, described herein abbve. Suitable expression vectors which can be 

25 used include, but are not limited to, the following vectors or their derivatives: 
human or animal viruses such as vaccinia virus or adenovirus, yeast vectors, 
bacteriophage vectors (e.g., lambda phage), and plasmid and cosmid DNA 
vectors. 

The term "host cell", as used herein, refers to a ceH into which a 
30 heterologous nucleic acid molecule has been introduced. Transformed cells, 
tissues, or organisms are understood to encompass not only the end product 
of a transformation process, but also transgenic progeny thereof. - 



aXXJIO: <VyO_0205370lA2J.> ' 



wo 02/053701 



PCT/USOl/49639 



A host cell strain can be chosen which modulates the expression of the 
inserted sequences, or rnodifies and processes the gene product in the specific 
fashion desired. For example, different host cells have characteristic and 
specific mechanisms for the translatiohal and post-translational processing and 
5 modification (e.g., glycosyiation, phosphorylation of proteins). Appropriate cell 
lines or host systems can be chosen to ensure the desired modification and 
processing of the foreign protein expressed. Expression in a bacterial system 
can be used to produce a non-glycosylated core protein product. Expression 
in yeast will produce a glycosylated product. Expression in animal cells can be 
10 used to ensure "native" glycosylation of a heterologous protein. 

Expression constmcts are transfected into a host cell by any standard 
method, including electroporation, calcium phosphate precipitation, DEAE- 
Dextran transfection, liposome-mediated transfection, and infection using a 
retrovirus. The EP1 7-encoding nucleotide sequence carried in the expression 
15 construct can be stably integrated into the genome of the host or it can be 
present as an extrachromosomal molecule. 

Isolated polypeptides and recombinantly produced polypeptides can be 
purified and characterized using a variety of standard techniques that are well 
known to the skilled artisan. See, e.g. chapter 16 of Ausubel et ai. (1992), 
20 Bodanszky, et al. (1976) "Pepf/de Synthesis", John Wiley and Sons, Second 
Edition, New York., and Zimmer et al. (1 993) "Peptides" pp. 393-394, ESCOM 
Science Publishers, B: V. 

L C. Nucleotide and Amino Acid Sequence Comparisons 
The terms "identicar or percent "identity" in the context of two or more 
25 nucleotide or polypeptide sequences, refer to two or more sequences or' 
subsequences that are the same or have a specified percentage of amino acid 
residues or nucleotides that are the same, when compared and aligned for 
maximum correspondence, as measured using one of the sequence 
... ... comparison algorithms disclosed herein or by vi 

30 The temn "substantially identical" in regards to a nucleotide or 

polypeptide sequence means that a particular seqiience .varies from the. 
sequence of a naturally occurring sequence by one or more deletions. 
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substitutions, or additions^ the net effect of which is to retain at least some of 
biological activity of the natural gene^ giene product^ or isequence. Such 
sequences include "rnutant" sequences, or sequences W^^^^ 
activity is altered to some degree but retains at least some of the original 
5 biological activity. The term "naturally occurring", as used herein, is used to 
describe a composition that can be found in nature as distinct from being 
artificially produced by man. For example, a protein or nucleotide sequence 
present in an organism, which can be isolated from a source in nature and 
which has not been intentionally modified by man in the laboratory, is naturally 

10 occurring. 

For sequence comparison, typically one sequence acts as a reference 
sequence to which test sequences are compared. When using a sequence 
comparison algorithm, test and reference sequences are entered into a 
computer program, subsequence coordinates are designated if necessary, and 

15 sequence algorithm program parameters are selected. The sequence 
comparison algorithm then calculates the percent sequence identity for the 
designated test sequence{s) relative to the reference sequence, based on the 
selected program parameters. 

Optimal alignmentof sequences for comparison can be conducted, e.g., 

20 by the local homology algorithm of Smith and Waterman (1 981 ) AdvAppI Math 
2:482, by the homology alignment algorithm of Needleman and Wunsch (1 970) 
J Mol Biol 48:443, by the search for similarity method of Pearson and Lipman 
(1988) Proc Natl Acad ScLUSA 85:2444, by computerized implementations of 
these-^lgorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin 

25 Genetics Software Package, Genetics Computer Group, Madison, Wl), or by 
visual inspection (§ee ger)era//y, Ausubel et al. (1992)). 

A preferred algorithm for determining percent sequence identity and 
sequence similarity is the BLAST algorithm, which is described in Altschul et 
. al. (1990) J yWo/ S/o/ 215: 403-410. Software for performing BLAST analyses 

30 is publicly available through the National Center for Biotechnology Information 
(http;//www:ncbi.nlm.nih.gov/). This algorithm involves first identiiying high 
scoring sequence pairs (HSPs) by identifying short words of length W in the 
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query sequence, which either.match or satisfy isorne positlve^valued threshold 
score T when alijghed with a word of the same lengjth in a datiabase sequence. 
T is referred to as the neighborhood word scoria threshold. These initial 
neighborhood word hits act as seeds forihitiating searches to find longer HSPs 
5 containing them. The word hits are then extended in both difections along each 
sequence for as far as the cumulative alignment score c3n be increased. 
Cumulative scores are calculated using, for nucleotide sequences, the 
parameters M (reward score for a pair of rhatching residues; always > 0) and 
N (penalty: score for mismatching residues; always < 0). For amirio acid 

10 sequences, a scoring matrix is used to calculate the cumulative score. 
\C) Extension of the word hits in each direction are halted when the cumulative 

alignment score falls off by the quantity X from its maximum achieved value, 
the cumulative score goes to zero or below due to the accumulation of one or 
: more negative^scoring residue alignments, or the end of either sequence is 

15 reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide 
sequences) uses as defaults a wordlength W=1 1 , ah expectation E=1 0, a cutoff 
of 100, M=5, N=-4, and a comparison of both strands. For amino acid 
sequences, the BLASTP program uses as defaults a wordlength (W) of 3, an 

20 expectation (E) of 10, and the BLOSUM62 scoring matrix. See Henikoff and 
Henikoff (1989) Proc Natl Acad Sci USA 89:10915. 

In addition to calculating percent sequence identity, the BLAST 
algorithm also performs a statistical analysis of the similarity between two 
sequences: See, e.g., Kariin and Altschul (1993) Proc NatJ Acad Sci USA 

25 90:5873-5887. One measure of similarity provided by the BLAST algorithm is 
the smallest sum probability (P(N)), which provides an indication of the 
probability by which a match between two nucleotide or amino acid sequences 
would occur by chance For example, a test nucleic acid sequence is 
. considered similar to a reference sequence if the smallest sum probability in a 

30 comparison, of the test nucleic acid sequence to the reference nucleic acid 
sequence is less than about 0.1, more preferably, lesis than about 0.01 , and 
most preferably less than about 0.001 . 



;OOCID: <WO_020S3701A2J_> 



wo 02/053701 



PCT/USOl/49639 



-34- 

i: Dv Antibodies v 

The present invention also provides an knti^^ 
an EP1 7 polypeptide. The term "antibody" indicates an Immunogloburin 
protein, or functional portion thereof, including a polyclonal antibody, a 
5 monoclonal antittody, a chimeric antibody, a single chain antibody. Fab 
fragments, and an Fab expression library. "Functional portion" refers to the 
part of the protein that binds a molecule of interest. In a preferred embodiment, 
an antibody of the invention is a monoclonal antibody. Techniques for 
preparing and characterizing antibodies are well known in the art (See , e.g., 

1 0 Harlow and Lane (1 988) Antibodies: A Laboratory Manual "Cold Spring Harbor 
Laboratory Press; Cold Spring Harbor, New York). A monoclonal antibody of 
the present invention can be readily prepared through use of well-known 
techniques such as the hybridoma techniques exemplified in U.S. Pat No 
4,196,265 and the phage-displayed techniques disclosed in U.S. Patent No, 

15 5,260-203. 

The phrase "specifically (or selectively) binds to an antibody", or 
"specifically (or selectively) immunoreactive with", when referring to a protein 
or peptide, refers to a binding reaction which is determinative of the presence 
of the protein in a heterogeneous population of proteins and other biological 

20 materials. Thus, under designated immunoassay conditions, the specified 
antibodies bind to a particular protein and do not show significant binding to 
other proteins present in the sample. Specific binding to an antibody under 
such conditions can require an antibody that is seliscted for its specifiGity for a 
particular protein. For example, antibodies raised to a protein with an amino 

25 acid sequence encoded by any of the nucleic acid sequences of the invention 
can be selected to obtain antibodies specifically immunoreactive with that 
protein and not with unrelated proteins. 

The use of a hiolecular cloning approach to generate antibodies, 
:^"L..' particufariy mondcldnaT antibodies, and more particularty single chain 

30 monoclonal antibodies, are also provided. The production of single chain 
antibodies has been described in the art. See , e.g.. U.S. Patent No: 5,260,203. 
For this approach, combinatorial immunoglobulin phagemid libraries are 
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prepared from RNA isolated from the spleen of the immunized animal, and 
phagemids expressing appropriate antilx)dies' are sete^ 
endothelial tissue. The advantages of this approach over conyentidnal 
hybridoma techniques are that apprdximately 1 0'* times as rhany antibodies can 
5 be produced and screened in a single round, and that new specificities are 
generated by heavy (H) and light (L) chain combinations in a single chain, 
which further increases the chance of finding appropriate antibodies. Thus, an 
antibody of the present invention, or a "derivative" of an antibody of the present 
invention, pertains to a single polypeptide chain binding molecule which has 
10 binding specificity and affinity substantially similarto the binding specificity and 
affinity of the light and heavy chain aggregate variable region of an antibody 
described herein. 

The term "immunochemical reaction", as used herein, refers to any of 
:a variety of immunoassay formats used to detect antibodies specifically bound 
15 to a particular protein, including but not limited to, competitive and non- 
competitive assay systems using techniques such as radioimmunoassays, 
ELISA (enzyme linked immunosorbent assay), "sandwich" immunoassays, 
vfimmunoradiometrie assays, gel diffusion precipitin reactions, immunodiffusion 
assays, /n s/Yu immunoassays (e.g., using colloidal gold, enzyme or 
20 radioisotope labels), western blots, precipitation reactions, agglutination assays 
(e.g., gel agglutination assays, hemagglutination assays), complement fixation 
® assays, : immunofluorescence assays, protein A : assiays, and 

. immunoelectrophoresis assays, etc; See Hariow and Lane (1988) for a 
description of immunoassay formats and conditions. 
25 I. E. Protein Binding Assays ' 

The term "binding" refers to an affinity between two molecules, fd^ 
example, a ligand and a receptor, means a preferential binding of one molecule 
for another in a mixture of molecules. The binding of themolecules can be 
„ considered specific if the binding affinity is about 1 x lC^ M"^ to about 1 x 10^ 
30 M ^ or greater. Binding of two molecules also encompasses a quality or state 
of mutual action such that ah activity of one protein or compound on another 
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protein is inhibitory (in the case of Jan antagonist) or enhanGihg (in the case of 
ah agonist). 

Fluorescence Correlation Spectroscopy (FCS) theory was developed in 
1 972 but it is only in recent years that the technology to perform FCS became 
5 available (Madge et al. (1972) Phys Re. Lett 29:705-708; Maiti et al. (1997) 
Proc Natl Acad Sci USA, 94: 11753-11757). FCS measures the average 
diffusion rate of a fluorescent molecule within a small sample volume. The 
sample size can be as low as 10^ fluorescent molecules and the sample 
volume as low as the cytoplasm of a single bacterium. The diffusion rate is a 

1 0 function of the mass of this molecule and decreases as the mass increases. 
FCS can therefore be applied to protein-ligand interaction analysis by 
measuring the change in mass and therefore in diffusion rate of a molecule 
upon binding. In a typical experiment, the target to be analyzed is expressed 
as a recombinant protein with a sequence tag, such as a poly-histidine 

15 sequence, inserted at the N-terminus or C-terminus, The expression takes 
place in E. co//, yeast or mammalian cells. The protein is purified by 
chromatography. For example, the poly-histidine tag can be used to bind the 
expressed protein to a metal chelate column stieh as Ni2+ chelated on 
iminodiacetic acid agarose. The protein is then labeled with a fluorescent tag 

20 such as carboxytetramethylrhddamine or BODIPY™ (Molecular Probes, 
Eugene, Oregon). The protein is then exposed in solution to the potential 
Ijgandi and its diffusion rate is^ determined by FCS using instrumentation 
available from Carl Zeiss, Inc. (Thomwapd, New York). Ligand binding is 
detemhined by changes in the diffusion rate of the protein. 

25 Surface-Enhanced Laser Desorption/lpnization (SELDI) was invent 

by Hutchens and Yip {^993) Rapid Commun Mass Spectrom 7:576-580). 
When coupled to a trme^of-flight mass specfrorneter (TGF), SELDI provides 
means to rapidly analyze molecules retained on a chip. It can be applied to 
. .... ligand-protein interactiorf analysis by covalently binding the target protein on 

30 the chip and analyze by MS the small molecules that, bind to. this protein 
(Worrall et al. (1 998) Anal Biochem. 70: 750-756). In a typical experiment, the 
target to be analyzed is expressed as described for FCS. The purified protein 
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is then used in tlie iassay without further preparation. It is bound to the SELDI 
Ghip either by utilizing the poly-^histidine tag or by other interaction such as ion 
exchange or hydrophobic interaction. The chip thus prepared is then exposed 
to the potential Ijgahd via, for example, a delivery system able to pipet the 
5 ligands in :a sequential manner (autosampler). The chip is then subnriitted to 
washes of increasing stringency, for example a series of washes with buffer 
solutions containing an increasing ionic strength. After each wash, the bound 
materiar is analyzed by submitting the chip to SELDI-TOF. Ligands that 
specifically bind the target are identified by the stringency of the wash needed 
10 toelutethem. 

Biacore relies on changes in the refractive index at the surface layer 
upon binding of a ligand to a protein immobilized on the layer. In this system, 
a collection of small ligands is injected sequentially in a 2-5 microliter cell, 
therein the protein is immobilized within the cell. Binding is detected by 
15 surface plasmon resonance (SPR) by recording laser light refracting from the 
surface. In general, the refractive index change for a given change of mass 
concentration at the surface layer is practically the same for all proteins and 
.^peptides, allowing a single method to be applicable for any protein (Liedberg 
etal. (1983) Sensors Actuators 4:299-304] Malmquist (1993) Nature 361:186- 
20 187). In a typical experiment, the target to be analyzed is expressed as 
described for FCS. The purified protein is then used in the assay without 
further preparation; It is bound to the Biacore chip either by uliliziriig the poly- 
histidine tag or by other interaction such as ion exchange or hydrophobic 
interaction.; The chip thus prepared is then exposed to the potential ligand via 
25 the deliyeiy system incorporated in the instruments sold by^Biacore (Uppsala, 
Sweden) to pipet the ligand? Jn a sequential manner (autpsampler). The SPR 
signal on the chip |s recorded and changes in the refractive index indicate an 
interaction between the immobilized tariget and the ligand. Analysis of the 
. - signal kinetics oh rate and off rate allows the discrimination between non- 
30 specific and specific interaction. - 
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I , F. Transgenic animals : : 

It is also within the scope, of the present invention to prepare a 
transgenic animal to mutagenize the EP17 locus or to express a transgene 
comprising nucleic acid sequences of the present invention. The term 
5 "triansgenic animal", indicates an animal comprising a germline insertion of a 
heterologous nucleic acid. Transgenic animals of the present invention are 
understood to encompass not only the end product of a transformation method, 
but also transgenic progeny thereof^ 

The term "transgene", as used herein indicates a heterologous nucleic 

1 0 acid molecule that has been transformed into a host cell. For intended use in 
the creation of a transgenic animal , the transgene includes genomic sequences 
of the host organism at a selected locus or site of transgene integration to 
mediate a homologous recombination event. A transgene further comprises 
nucleic acid sequences of interest, for example a targeted modification of the 

1 5 gene residing within the locus, a reporter gene, or a expression cassette, each 
defined herein above. 

Transgene integration can be used to create gene mutations, including 
"knock-out", "knock-in", or a "knock-down" mutations. The term "knock-out" 
refers to a homologous recombination event that renders a gene inactive. 

20 Gene knock-out is generally accomplished by integration of the transgene at 
a chromosomal loci, thereby interrupting a gene residing at that loci. The term 
"knock-in" refers to /n v/Vo replacement at a targeted locus: Knock-in rnutatlons 
can modify a gene sequence to create a loss-of-furiction or gain-ofrfunction 
mutation. The term "gene knock-down" refers to a Homblbgous recombination 

25 event Wherein the transgene partially eliminates gene function, A kriock-down 
• animal can be created by transgenic expression of an antisense molecule, 
wherein a fransigene comprising the antisense sequence and a relevant 
promoter are integrated into the genome at a non-essential loci. Expression 
of the antisense or ribozyme molecule disrupts the corresponding gene 

30 function, although this disruption is generally incomplete (Luyckx et aL (1999) 
Proc Natl Acad Sci USA 96(21 ): 1 21 74-1 21 79). 
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Conditional mutation can be accorripj^ 
(X)mbination with the Cre^recom^^^ 

a transgenic mouse is derived: that expresses Cre-recombinase under the 
: direction of ian inducible promoter. A second transgenic mouse bearis a 
5 mutation of ;a gene of interest as well as a lox-R-flanked endogenous gene 
sequence. Such transgenic mice are mated, the resulting progeny having both 
the Cre-recombinase and lox-P-flanked transgenes. Induction of Cre 
recdmbinase catalyzes excision of the lox-P-flanked transgene, thereby 
excising a portion of the endogenous gene sequence and revealing the 

10 mutated sequence. Conditional knockout can be varied according to the 
temporal and spatial features of Cre recombinase expression, inherent in the 
selection of a promoter to drive Cre recombinase. See Postie et al. (1 999) J 
Biol Chem 275(1 ):305-31 5; and Sauer (1998) Methods 14(4):381-392. 

Transgenes can also be used for heterologous expression in a host 

1 5 V organism without generating phenotypically apparent mutations. By this 
method, nucleotide sequences of interest are introduced into the genome at a 
nonessential loci, whereby insertion alone does not disrupt an essential gene 
function. 

Techniques for the preparation of transgenic animals are known in the 
20 art. Exemplary techniques are described in U.S. Patent No. 5,489,742 
(transgenic rats); U.S. Patent Nos. 4,736,866, 5,550,316. 5,614,396, 5,625,125 
and 5,648,061 (transgenic mice); U:S: Patent No. 5;573,933 (transgenic pigs); 
M 62,21-5- :(tranggen^^^ species) and U.S. Patent No. 5,741,957 

(transgenic bovine?,species). Briefly, nucleotide sequences of interest are 
26; cloned into a vector (e.g., pLNK - Gorman et al. 1996), and the construct is 
transformed into a germ cell. In the gemni cell, a chromosomal rearranjgement 
event takes place wherein the nucleic acid sequences of interest are integrated 
into the genome of the. germ cell by homologous recombination. Fertilization 
and propagation of the transformed germ cell results in a transgenic animal. 
3Q Homozygosity of the mutation is accomplished by intercrossing^. 
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I" G. Therapeutic Methods 

The present invention further provides methods - for disGdvering 
substances that can be used as pharmaceutical compositions. The term 
"pharmaceutical composition" or "drug" as used herein, each refer to any 
5 substance having a biological activity: Substances discovered by methods of 
the present inYention Include but are not limited to polypeptide, proteins, 
peptides, chemical compounds, and antibodies. 

A composition of the present invention is typically formulated using 
acceptable vehicles, adjuvants, and carriers as desired. 
10 Among the acceptable vehicles and solvents that can be employed are 

water. Ringer's solution, and isotonic sodium chloride solution. In addition, 
sterile, fixed oils are conventionally employed as a solvent or suspending 
medium. For this purpose any bland fixed oil can be employed including 
synthetic mono- or di-glycerides. In addition, fatty acids such as oleic acid find 
1 5 use in the preparation of injectable compositions. 

Injectable preparations, for example sterile injectable aqueous or 
oleaginous suspensions, are formulated according to the known art using 
suitable dispersing or wetting agents and suspending agents. The sterile 
injectable preparation can also be a sterile injectable solution or suspension in 
20 a nontoxic diluent or solvent, for example, as a solution in 1 ,3-butanediol. 

A vector can be used as a carrier, for example an adenovirus vector, can 
be used for gene therapy methods. The vector is purified to sufficiently render 
it essentially free of undesirable contaminants, such as defective interfering 
adenovirus particles or endotoxins and other pyrogens subh that it does not 
25 cause any untoward reactions in the individual receiving the vector construct 
A preferred means of purifying th^ vector involves the use of buoyant density 
gradients, such as cesium chloride gradient eentrifugation. 

A transfected cell can also serve as a carrier. By way 6^^^ 
/. .. ceil can be removed from an organism, transfected with a nucleic acid 
30 sequence of the present invention using methods set forth above and then the 
transfected cell returned to the organism (e.g. injected intra-vascularly). 
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Monoclonal antibodies or polypeptides of the invention can be 
administered parenterally by injection or by gradual infusion over time. 
Although the tissue to be treated can typically be accessed in the body by 
systemic administration and therefore most often treated by intravenous 
5 administration of therapeutic compositions, other tissues and delivery means 
are provided where there is a likelihood that the tissue targeted contains the 
target molecule and are known to those of skill in the art. 

Representative antibodies for use in the present invention are intact 
immunoglobulin nriolecules, substantially intact immunoglobuiin molecules, 

10 single chain immunoglobulins or antibodies, those portions of an 
immunoglobulin molecule that contain the paratope, including antibody 
fragments. It is contemplated to be within the scope of the present invention 
that a monovalent modulator can optionally be used. 

Methods of preparing "humanized" antibodies are generally well known 

15 in the art, and can readily be applied to the antibodies of the present invention. 
Humanized monoclonal antibodies offer particular advantages over monoclonal 
antibodies derived from other manrimals, particularly insofar as they can be 
used therapeutically in humans. Specifically, humanized antibodies are not 
cleared from the circulation as rapidly as "foreign" antigens, and do notactivate 

20 the immune system in the same manner as foreign antigens and foreign 
antibodies. 

With respect to the therapeutic methipds of the present invention, a 
preferred subject is a vertebrate subject. . A preferred vertebrate is warrn- 
blooded; a preferred warm-blooded vertebrate is a mamnnal. A preferred 

25 mammal is a mouse or, most preferably, a human. ; As used herein and in the,, 
claims, the term "patient" Includes both human and animal patients. Thus, 
veterinary therapeutic uses are provided in accordance with the present 
invention. . ^ : . v . 

: : . Also provided IS the treatment of mamrhals such a^s^ 

30 those mamnials of importance due to being endangered, such as Siberian 
tigers; of economical importance, such ais animals raised on farms for 
consumption by humans; and/or aninials of social inriportance to humans, such 
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as animals kept as pets of in zoos. Examples of such ianimals include but are 
not limited to: camivores such ais cats and doQs; swine, including pigs, hogs, 
and wild boars; mminants and/or ungulates such as cattle, oxen, sheep, 
giraffes, deer, goats, bison, and camels; and horses. Also provided is thfe 
5 treatment of birds, including the treatment of those kinds of birds that are 
endangered and/or kept in zoos, as well as fowl, and more particularly 
domesticated fowl, /.e., poultry, such as turkeys, chickens, ducks, geese, 
guinea fowl, and the like, as they are also of economical importance to 
humans. Thus, provided is the treatment of livestocki including, but not limited 
10 to, domesticated swine, ruminants, ungulates, horses, poultry, arid the like- 
As used herein, the term "experimental subject" refers to any subject or 
sample in which the desired measurement is unknown. The term "control 
subject" refers to any subject or sample in which a desired measure is 
unknown. 

15 As used herein, an "effective" dose refers to one a dose(s) administered 

to an individual patient sufficient to cause a change in EP 17 activity. After 
review of the disclosure herein of the present invention, one of ordinary skill In 
the art can tailor the dosages to an individual patient, taking into account the 
particular formulation and method of administration to be used with the 

20 composition as well as patient height, weight, severity of symptoms, and stage 
of the biological condition to be treated. Such adjustments or variations, as 
well as evaluation of when and how to make such adjustments or variations, 
are well known to those of ordinary skill in the art of medicine. 

A therapeutically effective amount can comprise a range of amounts. 

25 Qne skilled In the art can readily: assess the potency arid efficacy of an EP17 
modulator of this invention and adjust the therapeutic regimen accordingly. A 
modulator of EP17 biological activity can be evaluated by a variety of means 
including the use of a responsive reporter gene, interaction of lipocalin 
0 polypeptides with a^^ fertility assays, each technique . 

3d described herein: - . : >; 
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Additional formulation and dose techniques liave been described in the 
art, see for example, those described in U.S! Patent Nos. 5,3261902 and 
, 5,234,933, and PCT Publication WO 93/25521 . 

For the purposes described above, the identified substances can 
5 normally be administered systemically, pafenterally, or orally/ Th^ 

"parenteral" as used herein includes intravenous, intra-muscular, intra-arterial 
injection, or infusion techniques. Other compositions for administration include 
liquids for external use, and endermic liniments (ointment, etc.), suppositories, 
and pessaries which comprise one or more of the active substance(s) and can 
10 be prepared by known methods. 

II. Cloning and Expression of the mouse EP17 Gene 

The present co-inventors have identified a new lipocaliri, mEP17, that 
is adjacent to a related lipocalin-encoding gene, mE-RABP, on mouse 
chromosome 2 (Figure 1). The genomic organization of the mEP17 gene was 
15 determined by prediction of exons within a BAG genomic clone and furtheir 
isupported by cloning of the mEP17 cDNA (Lareyre et a!., 2001). Figure 1 
depicts the genomic organization of the mEPIT locus. Exon sizes are 
indicated in nucleotides. The major transcription initiation sites of both genes 
are represented with broken arrows. Primer FwMEP17cDNA (SEQ ID NO:7) 
20 was used for primer extension analysis, as described herein below. The G-X- 
W and T-D-Y and motifs and two cysteine residues (C) are also indicated. 

To isolate the mEP17 promoter region, two clones containing 6.3 kb 
. EcoRV restriction fragments within the 5' flanking rpgion were isolated from the 
genomic BAG clone 10983 (Figure 9). DNA sequencing analysis of both 
25 ; clones revealed that the 6i3 kb fragraent.contains>5.4 kb of the 5' flanking 
region of the mEP.17 gene. Cloning methods are provided in Example 1. 
; . - ; iThe tissue distribution of mRNA encoding the rTiER17 protein was 
examined by Northern blot analysis of total RNA from twelve different tissues, 
; : ^ :;v ^ including ^spleen, liver, h^art, rlung; ^ brain;^ kidney, [testis, epididymis, vas 
30 deferens, seminal vesicles, utems. and ovary (Figure ;2A); Hybrid^^^^ 

: Northem blots with a p^P]-radiolabeled mEPI 7 cDNA probe reyealed two RNA 
species of about 3.1 kband 1 kb only in the epididymis. The total length of the 
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mEP17gene, Including exdns and intron, is 3.1 kb. To determine whether the 
3.1 kb RNA could be the precursor RNA, two dpididynnal RNA sannples were 
run side by side and hybridized individually with the cDNA probe or with a 
probe encompassing intron 1 of the rtiEPIT gene (Figure 2B). The first intron 
5 1 probe hybridized with the 3.1 kb RNA but not the 1 kb RNA, indicating that 
the 3.1 kb RNA is an unspliced precursor RNA. 

To investigate the tissue-, region-, and cell-specific profile of gene 
expression, in situ hybridization of mEP1 7 transcripts was carried out using 
sense and antisense digoxygenin-labeled riboprobes generated from mEP17 

10 cDNA (Figures 3 and 4a), as described in Example 2 below. mEP17 mRNA 
was detected only in the principal cells of the initial segment of the epididymis 
and is localized basally. Hybridization was not detected when the sense 
riboprobe was used (Figure 4B). mEPI 7 gene expression was high throughout 
the initial segment (IS). A checkerboard pattern was observed at the boundary 

1 5 between the initial segment and the proximal caput epididymis, wherein some 
cells expressed mEP17 and some cells did not. mEPI 7 mRNA was not 
detected in the efferent ducts (ED), mid and distal caput (Cp), corpus and 
Cauda epididymis using sense or antisense probes. 
III. Isolation of Human EP17 

20 The present invention also provides a human EP17 gene. Preferably, 

the human EP17 gene comprises the sequence set forth as SEQ ID:2, a 
nucleic acid molecule that is substantially similar to SEQ ID NO:2, or a nucleic 
acid molecule comprising a 20 base pair nucleotide sequence that is Identical 
to a contiguous 20 base pair sequence of SEQ ID NO:2: 

25- The mouse EP1 7 sequence was used to query datab^^ 

genomic sequence, including GenBank and proprietary databases of Celera 
Genomics Corp. (Rockvtlle, Maryland^ Two DWA: fragments derived from 
human chromosorhe 9 were Identified (Accession numbers AL35598.7 and 
- 449425.3) in GehBarik- althou Or the conibiriation of the 

30 sequences predicts the; hEP17 gene. A genonriic regidn> having sequences 
^ ■ conr0sponding to the hEP17 gene was als^^ in Celera's database. 
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(Accession number GA 65 373998). The genomic sequence derived from 
Celera's database was unannotated and did not predict the hEP17 gene. 

The hEP17 gene disclosed herein (SEQ ID NO:2) was predicted by 
comparing unannotated genomic sequence and the gene structure of mouse 
5 EP1 7. Conserved intron/exon boundaries and conserved nucleotide sequence 
were recognized and used to construct the gene map depicted in Figure 5 and 
Table 1 . Preferably, the human EP17 gene comprises a coding region and a 
promoter region set forth as SEQ ID NOs:3 and 5, respectively. 

Tablel 

10. 



Feature 


base pairs 




from 


to 








Exon 1 


5160 


5255 


Intron 1 


5256 


12495 


Exon 2 


12496 


12626 


Intron 2 


12627 


13072 


Exon 3 


13073 


13143 


Intron 3 


13144 


14148 


Exon 4 


14149 


14253 


Intron 4 


14254 


14331 


Exon 5 


14332 


14421 


Intron 5 


14422 


14504 


Exon 6 


14505 


14530 


Intron 6 


14531 


15155 


Exon 7 


15156 


15279 



The predicted hEP17 gefie displays sequence homology with other 
lipbcalins, most notably with m^ERABP and prdstaglaridirt D 'Sdmerases s 
(Figure 6). The mouse and humdn EP1 7 proteins share 61% overaU ide 
and have conserved lipocalin motifs (G-X-W.T-D-Y, and two cysteine residues) 
^t similar positions (Figure 7). ' • 

The amino terminal regionsf of both mouse and human EPI 7 proteins 
are predicted to be a signal peptide since in each c^se the region is 
hydrophdbiGi similar tq^the signal peptide of the mE^RABP>prdteir)^^^^^ in 
^ agreement with the- sliding wiridow/rnatnx scoring' mMhpd^^^^ ,-3 rulfe for 
predicting a peptide cleavage (vori HeijI (1986) A/ucfe/c Ac/d Re^ 
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4690) (Figure 8). This observation implies that the human and mouse EPi7 

genes encode secreted proteins. 

IV, Characterization of an EP1 7 Promoter Region 

The transcription initiation sites of the rhEPI ? gene were determined by 
5 primer extension using epididymal total RNA as a template and the EP17PE2 
primer (SEQ ID NO:7) designed according to sequence in the first exon (Figure 
10). Primer extension methods are described in Example 4 below. Two major 
transcription initiation sites were localized 22 and 18 nucleotides from the 
putative translation initiation site, and were numbered +1 and +5, respectively. 

10 Two minor transcription initiation sites were also detected at position +2 and 
+4. As shown in Figure 10, total RNA extracted from the epididymis (Ep) or 
transfer (t) RNA was reverse transcribed with p^P]-radioIabeled EP17PE2 
primer (SEQ ID NO:7) and extended using Avian Myeloblastosis Virus (AMV) 
reverse transcriptase. Lanes labeled "C", 'T", "A" and "G" are pS]- 

15 radiolabeled DNA sequencing reactions carried outusingthe EP17PE2 primer 
(SEQ ID NO:7) and the pHindlll clone (shown in Figure 9) as template. The 
localization of two major (arrows) and two minor (arrowheads) transcription 
initiation sites are indicated. 

The 5' flanking sequence closest to the transcription start site was 

20 analyzed further. A 2.5-kb EcoRI restriction fragment comprising this sequence 
was isolated from the genomic BAG clone 10983 (shown in Figure 9). A 
computer analysis to identify putative cis-regulatory sites was carried out using 
TFSEARCH version 1.3 (Y.utaka Akiyama: 'TFSEARCH: Searching 
Transcription Factor Binding Sites"i^^http://vvww;rwcp.or.jp/pa 

25 V revealed the presence of several binding sites for known transcription factprsi 
including binding sites for aridrogen receptor, (ARSB), retinoic acid receptor 
(RARE), Stimulating Protein 1 (SP-1). Activator Protein 1 (AP-1), Activator 
Protein 4 (AP-4), SRY (SexTdetenrTiining Region Y protein), C-Ets (cellular ets 
oncogene), C/EBP(CCAAT/enhancerbi 

30 sequence #5 protein). Putative cis-regulatory sites are underiined in Figure 11. 
: : A consensus: TATA box .and GAAT-box are indicateds iVIajor transcription 
^ * initiation sites are marked by. long arrows, andiminor transcription initiation sites 
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are marked by arrowheads. The computer analysis was carried out usjng 
TFSEARCH version 1.3 [Yukata Akiyama: 'TFSEARCH: Searching 
Transcription Factor Binding Sites", hftp://www.rWcp.or.jp/papia/]. 

To define functional sequences within the promoter region, several 
5 chimeric reporter genes were constructed by ligation of various portions of the 
5' flanking region of the EP17 gene and a reporter gene (Figure 12), as 
described in Example 5 below. Each chimeric gene comprises a different 
fragment of the 5.3 kb EP1 7 promoter region, an open reading frame iBncoding 
a reporter giBhe operably linked to a promoter fragment, and the polyA tail 
1 0 region of Simian Virus 40 large T antigen. Figure 12 indicates one prefen^ed 
reporter, chloramphenicol acetyltransferase (CAT). These constructs are 
alternatively used for /n vitro and in vivo assays of EP17 promoter region 
function. 

A preferable in vitro technique for evaluating EP1 7 promoter function is 
15 a transient transfection assay. According to this method, each chimeric 
reporter gene is introduced into a relevant host cell, and the resulting level of 
reporter gene expression is quantitated. Preferred host cells include HeLa and 
PC-3 cellsf, or normal or immortalized epididymal cells, described herein below. 
In these experiments, luciferase is a preferable reporter gene in that it 
20 demonstrates increased sensitivity of detection; Transient transfection assays 
are performed as described in Example 6. Additional methods for making an 
expression system comprising a promoter region operably linked to a 
heterologous reporter sequence are disclosed in U.S- Patent No. 6,087,1 1 1 . 
To analyze the function of an EP17 promoter region in vivo, transgenic 
25 mice bearing each chimeric geria are generated as described in Example 7 
b6loWi and a level of reporter gene expression in each mouse is determined. 
For these experiments^ 

endogenous activity in the epididymis. ; Several assays are performed to 
■. ■:::-r^ ' charadterizfe GAT exiiressidn in transgenic animals, inc^ 

^> 30 specific primers, CAT enzymatic assays, immuriohistoehemistry using an anti- 
; CAT antibody, and in s/fiy hybridization u^ing a CAt-specific probev M^ 
for performing these assays can be found in Lareyre, J. J., et al. (1 999) J. S/oA 
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Chem. 274:8282-8290, in Lareyre et al. (2001) and Examples 2, 8, and 9. 

A transgenic mouse bearing the entire 6.3 l^b 5* flanking region of the 
EP17 gene operably linked to the CAT gene shows CAT expression in the 
caput epididymis, demonstrating that the 5.3 kb promoter region of the EP17 
5 gene contains sequences required for the region-specific expression of the 
EP17 gene. Shorter sequences of the EP1 7 promoter region can be used to 
define a minimal sequence requisite for EP1 7 gene expression. In determining 
a promoter region that reproduces endogenous EP17 expression, the 
expression profile of each chimeric gene can be carefully compared to the 
10 profile of EP1 7 gene expression as determined by in situ hybridization. 

Within a candidate promoter region or response element, the presence 
of regulatory proteins bound to a nucleic acid sequence can be detected using 
a variety of methods well known to those skilled in the art (Ausubel et al., 
1992). Briefly, in vivo footprinting assays demonstrate protection of DNA 
15 sequences from chemical and enzymatic modification within living or 
permeabilized cells. Similarly, in vitro footprinting assays show protection of 
DNA sequences from chemical or enzymatic modification using protein 
extracts. Nitrocellulose filter-binding assays and gel electrophoresis mobility 
shift assays (EMSAs) track the presence of radiolabeled regulatory DNA 
20 elements based on provision of candidate transcription factors. 

Genomic clones derived from GenBank and proprietary databases 
(Gelera Genomics Corp., Rockville, Maryland) were used to. predict an hEP17 
.promoter region comprising an about 5150 base pair region Immediately 
upstream of the hEPI 7 transcription start site (Figure 1 3). This region is similar 
25 to the promoter region of mEPI 7^ having putative cis-DNA regulatory elements 
included but not limited to a Sp-1 binding site, an:AP-1 binding site, a cAMP 
response element binding protein (GREB) binding srt^ 
. box gene 5 (Sox 5) binding site, a Sex-<letermining region Y gene product 
^ :._- (SRY) binding site, a c-Ets binding site, a GATA binding site, and an Octamer 
30 trasncription factor 1(Oct-1) binding site (Figure^M The hEPI 7 promoter is 
- ftjrther characterized in a manner as described herein above regarding the 
. mouse EP1 7 promoter region. 
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V. Methods for Identifying Regulators of Gene Expression 

The nucleic acid sequences of the present invention can be used to 
identify regulators of EP1 7 gene expression. Several molecular cloning 
strategies can be used to identify substance^ that specifically bind EP17 cis- 
5 regulatory elennents. A preferred promoter region to be used in such assays 
is an EP17 prornoter region from mouse or human, more preferably the 
promoter region includes some or all amino acids of SEQ ID NOs:1 or 5. 
Figures 15A-15C presents data mEP17 expression is not regulated by 
hormones. However, studies in which spermatogenesis was disrupted suggest 

10 that an EP17 lipocalin can be regulated by germ cell-associated factors. 

In one embodiment a cDNA library in an expression vector, such as the 
Iambda-gt1 1 vector, can be screened for cDNA clones that encode an EP17 
regulatory element DNA-binding activity by probing the library with a labeled 
EP17 DNA fragment, or synthetic oligonucleotide (Singh et aL (1989) 

1 5 Biotechniques 7:252-26^ ). Preferably the nucleotide sequence selected as a 
probe has already been demonstrated as a protein binding site using a protein- 
DNA binding assay described above. 

In another embodiment, transcriptional regulatory proteins are identified 
using the yeast one-hybrid system (Luo et al. (1 996) Biotechniques 20{4):564- 

20 568; Vidal et al. (1996) Proc Natl Acad Sci USA 93(19):10315-10320; Li and 
Herskowitz (1993) Science 262:1870-1874). In this case, a cis-regulatory 
element of an EP17 gene is pperably fused as ari upstream activating 
sequence (U AS) tO;one, or typically more, yeast reporter genes such as the 
lacZ gene, the URA3 gene, the LEU2 gene, the HISS gene, orthe LYS2 gene, 

25 and tlie reporter gene Ajsion cpnstruct(s) is inserted into an appropriate yeast 
host strain. It is expected that th© reporter genes are not transcriptionally 
acfiye in the^iengineered yeasthoststrai activator 
protein to bind , the UAS . derived from Jhe EPI 7^ promoter region. The 
:{ engirreeiredyeaslhc^ 

30 a yeast activation domain fusion prptein expressipn vector, e;g. pGAD, where. 

. .^. the coding regions, of the cDNA inserts are fused ; to a functional yeast 
- activation, domain coding segment^ such as those deriyed from the GAL4 or 
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VP16 activators. Transformed yeast cells tliat acquire a cDNA encoding a 
protein that binds a cis-regulatoiry element of an EP17 gene can be identified 
based on the concerted activation of the reporter genes, either by genetic 
selection for prototrophy (e.g. LEU2, HISS, or LYS2 reporters) or by screening 
5 with chromogenic substrates (lacZ reporter) by methods known in the art. 

In another embodiment, an in s/fu filter detection method is used to 
clone a cDNA encoding the sequence-specific DNA-binding protein as 
described in Example 10. 

In a more preferred embodirhent, one-hybrid analysis and /n srfa filter 

10 detection methods are used sequentially. For exarfiple, an initial collection of 
candidate transcription factors is identified by one-hybrid analysis, and this 
initial collection is secondarily screened using in situ filter detection. This 
combination of techniques provides a smaller but more confident pool of 
candidate regulators than selected by either technique alone. 

15 A candidate regulator to be tested by these innethods can be a purified 

molecule, a homogenous sample, or a mixture of molecules or compounds. 
More than one modulatable transcriptional regulatory sequence can be 
screened simultaneously. 

In accordance with the present invention there is also provided a rapid 

20 and high throughput screening method that relies on the methods described 
above. This screening method comprises separately contacting each 
compound with a plurality of substantially identical samples. In such a 
screening method the plurality of samples preflBrably comprises more than 
about 10"* sarnples, or more preferably comprises more than about 5 x 10^ 

25 samples. In dn altemiative high^hrbughptit Strategy, each sample can be 
contacted with a plurality of candidate compounds. 

The present invention also provides* an /n v^^ 
modulators of EP17 expression. In this case, a transgenic mouse is made 

^ ; ■: • such t^^^^^ an EP17 prdrtibter and a reporter gene is 

30 expiressed arid a level of reporter gene expression is assayable. SucH: 
transgenic animals can be uisedfortheidenfificatipn^^^^ 

therapies, and interventions that are effective in modulating EP17 expression/ 
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VL Method for Heterologous Gene Expression in Epididymis 

The present invention enables epididymal expression of a heterologous 
nucleic acid sequence. In this case, a transgenic animal is generated which 
bears a transgene that Includes an EP17 promoter region and a nucleotide 
5 sequence of interest; A preferred EP1 7 promoter is the nucleotide sequence 
of SEQ ID NO:1 or 5, more preferably a minlhial functional portion of SEQ ID 
NG:1 or 5 that drives appropriate epididymal expression, as determined by 
methods described herein above. 

In one embodiment, this method enables assay of the function of a gene 

10 of interest in epididymis to the exclusion of other sites of gene function. For 
example, the heterologous sequence can encode an antisense or ribozyme 
nucleic acid molecule. When expressed in epididymis under the direction of an 
EP1 7 promoter region, the function of a gene corresponding to the antisense 
or ribozyme nucleic acid molecule is disrupted in epididymis but not other 

15 tissues. 

In another embodiment, an EP1 7 promoter drives expression of a toxin, 
for example, thymidine kinase plus ganciclovir. Expression of the chimeric 
gene targets degeneration of the initial segment of the epididymis. In this case, 
the transgenic animal can be used as animal model of infertility, described 
20 further herein below. 

In another embodiment, an EP1 7 promoter region drives expression of 
a therapeutic gene or nucleotide sequence, as described herein below. 
VII; Method for Producing an Epididymal Ceil Line 

Another aspect of the inver^tion is a method for producing an epididyriial 
25 cell line. According to this method, a chirnenc gene isrcdnstnjcted to e^ 

a gene, encoding a selectable marker under lhe control of an EP1 7 promoter 
region, and the chimeric gene; is used to create a^^t^^ 

the selectable marker in epithelial qells df the Initial segment of the epididymis. 
Preferably, the selectable mar1<er confers a^^ and more 

30 preferably; the selectable marker confers neomycin resistance, which can be 
used even in selection of epididymal cells from noh-epididyrhal cells culture. 
Also: preferably, the EP1 7 promoter region used to pierform this method is the 
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sequence of SEQ ID NO:1; or functional portion thereof Similarly, using the 
niE-RABP promoter, a neoniycin-resistant immortalized cell line from the distal 
caput can be generated by this method. 

Also provided is a method for generating an immortalized epididymal cell 
line. In this case, a transgenic animal is obtained, having a transgene that 
encodes an oncogenic virus directed by a constitutive promoter. A preferred 
oncogenic virus comprises a temperature-sensitive (ts) Simian virus 40 large 
T antigen (Tegtmeyer (1975) J Wro/ 15(3):613-618). The ts-Simian virus 40 
large T-antigen is completely inactive at non-permissive temperature (SQ'^C), 
partially inactive at body temperature, and substantially active at a permissive 
temperature (33°C). Immortalized epithelial cells are procured from ts-Simian 
virus40 large T-antigen mouse are reproduced in culture. In one embodiment, 
epididymal cells may be selected using the BP 17 promoter operably linked to 
the neomycin resistant gene. Since the EP17 promoter is expressed in the 
initial segment, neo-selection will provide a pure population of epithelial cells 
from that segment. A neomycin^resistant immortalized cell line from the distal 
caput has been generated by this method using the E-RABP promoter and 
maintained in culture for 12 months. 

VIIL Method for HomoloQous Recombination at the EP17 locus 

Another aspect of the invention is a method for mutagenizing the EP17 
locus by homologous recombination. The method uses a targeting vector 
having an isolated EP 17 promoter: region, a marker gene, and an isolated 
EP17 3* flanking region. In a vector so constructed, the marker gene is 
positioned between the promoter region and the 3* flanking region. In another 
.embodiment, the; targeting vector further comprises a mutant EP1 7 coding 
sequence, also positioned between the promoter region and the 3- flanking, 
region; The targeting Vector is linearized > by digestion With a restrictiohv 
endonuclease at a site other than: within the promoter region, marker gene, 3' 
flanking region, and 6p^^^ 

V In a prefeited embodiment, the linearized vector is e^ info 
embfyonic stem cells, and successful electrbporatibn is assayed by detecting 
the marker gene in the. stem cells: Stem cells bearing the yector are used to 
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create a transgenic animal. According to the method, a homologous 
recombination event is mediated at the EP17 loCus, thereby exchanging native 
EP17 gene sequences positioned between the promoter region and the 3' 
flanking region with vector nucleotide seqCiehces positioned the same. 
5 The nucleic acids and methods of the present invention enable knock- 

out, knock-ih, and knock-down niutiations of the EP17 gene. The phenotype 
of EP17 mutant animals carl be characterized to reveal EP17 function. The 
expression of EP17 in epididymis, and the known functional importance of 
regulated epididymal gene expression for male fertility, suggest that EP1 7 will 

10 have a determinative role in sperm maturation- Methods for generating MEP17 
mutant mice are provided in Example 11 : 

A preferred knock-out mutation removes part of the EPt7 codrng region 
(exoni) and can be generated using a targeting vector as depicted in Figure 
1 6. Preferred knock-in mutations include mutation of any one of amino acids 

1 5 within the conserved lipocalin motifs to any amino acid that is non-conservative 
substitution. Other preferred knock-irl mutations are targeted replacement of 
one or both of the conserved cysteine residues with an amino acid(s) that is a 
non-conservative substitution. 

IX. Method for Detecting a EP17 Nucleic Acid Molecule 
20 In another aspect of the invention, a method is provided for detecting a 

nucleic acid molecule that encodes an EP17 polypeptide. According to the 
rhethod, a biological sample having nucleic acid material is procured and 
hybridized under stringent hybridization conditions to an EP17 nucleic acid 
molecule of the present invention. Such hybridization enables a nucleic acid 
25 molecule of the biological sample and the EP1 7 nucleic . acid niote^ 

a detectalDle duplex structure. Preferably, the EP17 nucleic acid molecule 
includes some or all nucleotides of SEQ ID NO:1y 2, 3, or 5. also preferably, 
the biological sample comprises human nucleic a^^ 
; : /. : " In aindtherenrtbpdimeriti genetic assays base^^^ molecules 
30 pf the present invention can be used to screen for gerietic Variants by a numbeir 
of PCR-b$sed techniques, iricludin^^^^^^^ 

(SSCP) arlalysis (Orita, M., et ah (1 989) Proc A/af/ Acad ScA L/SA 86(8):2766- 
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2770), SSCP/heteroduplex analysiis, enzyme mismatch cleavage, and direct 
sequence analysis of amplified exons (Kestila St al. (1998) Mol Cell 1(4):575- 
582; Yuan et al. (1999) Hum Mutat 14(5):440-446). Automated methods can 
also be applied to large-scale characterization of single nucleotide 
5 polymorphisms (Brookes (1999) Gene 234(2): 177-1 86; Wang et al. (1998) 
Science 280(5366): 1077-82). The present invention further provides assays 
to detect a mutation of a variant EP17 locus by methods such as allele-specific 
hybridization (Stoneking et al. (1991) >Am J Hum Genet 48(2):370-82), or 
restriction analysis of amplified genomic DNA containing the specific mutation. 
10 X. Recombinant Production of an EP17 Polvpeptide 

The present invention also provides a method for recornbinant 
production of a EP1 7 polypeptide, as described in Example 12. Preferably, the 
recombinant polypeptide comprises some or all of the amino acid sequences 
ofSEQIDNO:4or6. 

1 5 Mouse EP1 7 protein was recombinantly: produced using the pBAD/glll 

vector (Invitrogen of Carisbad, California). To confirm the production of EP1 7 
protein, total protein derived from transformed E.co// was resolved on a 
polyacrylamide gel, and Coomassie blue staining revealed two enriched bands 
of approximately 21 kDa and 23 kDa. Western blot analysis using an anti-his 

20 tag antibody revealed the same two proteins, which correspond to the 
processed and unprocessed EP17 isoforms, respectively (Figure 17) 

Recombinantly produced proteins are useful for a variety of purposes, 
including stmctural determination of an EP 17 polypeptide, generation of an 
antibody that recognizes an EP1 7^polyp.eptide, and screening assays to identify 

25 a chemical compound or peptide that interacts with an ER1 7 / polypeptide 
described further herein below. 
XI. Production of EP1 7 antibodies . 

In anotheraspect, the presentinvention providesia method of producing^^ 

_ - 0 - ah ; antibody immuhoreactive with a ilipocalin polypeptide, the method 

30 comprising recombinantly or synthetically producing an EP1 7 polypeptide, or 
portion thereof, to be used as ari antigen. The EP1 7 polypeptide is formulated 
sd^that it is used as an effective immunogen. An animal is jmrnunized with the 
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formulated EP17 polypeptide, generating an immune response in the animal. 
The immune response is characterized by the prbduction of antibodies that can 
be collected from the blood serum of the anImaL Preferred embodiments of the 
method use a polypeptide as of SEQ ID NO:4 or 6. 
5 The present invention also encompasses antibodies produced by this 

method. 

The foregoing antibodies can be used in methods known in the art 
relating to the localization and activity of the BP 17 polypeptide sequences of 
the invention, e.g., for cloning of ER17 nucleic acids, immunopurificatipn of 
10 EP17 polypeptides, imaging EP17 polypeptides in a biological sample, 
measuring levels thereof in appropriate biological samples, and in diagnostic 
methods. 

XII. Method for Detecting an EP17 Polvpeotide 

In another aspect of the invention, a method is provided for detecting a 

15 level of EP17 polypeptide using an antibody that specifically recognizes an 
EP17 polypeptide, or portion thereof. In a preferred embodiment, biological 
samples from an experimental subject and a control subject are obtained, and 
EP1 7 polypeptide is detected in each sample by immunochemical reaction with 
the EP17 antibody. More preferably, the antibody recognizes amino acids of 

20. SEQ ID NO:4 or 6 and is prepared according to a method of the present 
invention for producing such an antibody. 

In one embodiment, an EP1 7 antibody ijs used to screen a biological 
sample for the presence of a lipocalin polypeptide. A biological sample to be 
. screened can be a biological fluid such, as extracellular or intracellular fluid, or 

25 a cell pr. tissue extract or hompgenate. A .biological vsampl© <^n al^ an 
isolated cell (e.g., in culture) or a collection; of cells such as in a tissue sample 
or histblpgy sample. A tissue sample.cari be suspended in a liquid medium or 
fixed onto a solid support such as a rnicroscope slide. In accordance with a 
screening assay method, a biological; sarnple is to an>ahtibody 

30 immunoreactive with an EP17 polypeptide vvhose presence is being assayed^ 
and the formation of antibody-polypeptide complexes is detected. Techniques 
for detecting such antibody-antigen conjugates or cbmplexes are well known 



wo 02/053701 



PCTAJSOl/49639 



-56- 

in the art and include but are not limited to cehtrifugation; affinity 
chromatography and the like, and binding of a labeled secondary antibody to 
the antibody-candidate receptor complex. 

XIIL Screening for Small Molecule Liqarids that interact with EP1 7 
5 The present invention further discloses a method for identifying a 

compound that modulates EP1 7 function. According to the method, an EP17 
polypeptide is exposed to a plurality of compounds, and binding of a compound 
to the Isolated EP1 7 polypeptide is assayed. A compound is selected that 
demonstrates specific binding to the isdl3ted EP1 7 polypeptide. Prefeirably , the 

10 EP17 polypeptide used in the binding assay of the method includes some or 

all amino acids of SEQ ID NG:4 or 6. ^ 

Several techniques can be used to detect Interactions between a protein 
and a chemical ligand without employing an /n v/Vo ligand. Representative 
methods include, but are not limited to, fluorescence correlation spectroscopy, 

15 surface-enhanced laser desorption/ioniziation, and biacore technology, as 
described in Example 13. These methods are amenable to automated, high- 
throughput screening. 

Cahdidate regulators include but are not limited to proteins, peptides, 
and chemical compounds. Structural analysis of these selectants can provide 

20 information about ligand-target molecule interactions that enable the 
development of pharmaceuticals based on these lead structures. 

Similarly, the knowledge of the structure a native EP17 polypeptide ® 
provides an approach for irational drug design. The structure of an EP17 
polypeptide Can be determined^ by X-ray crystallography or by computational 

25 algorithhisthatgeneratethreeHclimensibria 

(2000) PacSymp B/ocompi7f230-41 : Saqiet al. (i 999) 8/^^^^ 
522. Computer models can further predict 

various substrate molecules, that can be synthesized and tested. Additional 
, - J: drug design techniqueb; are described in UfS. - Patent Nos. 5,834,228 and 
30 5;872i011. - 
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XIV. Animal Model of Male Infertility 

The present invention further pertains to an animal model of male 
infertility. Such a model is prepared by several methods. 

Usinig a transgenic approach, knock-out, knock-in, or kndck-<lown 
5 mutation of the EP17 gene can suppress fertility. In another embodinient, 
expression of a toxin, for example, thymidine kinase plus.ganciclovir, under the 
direction of an EP1 7 promoter targets degeneration of the initial segmentof the 
epididymis and thereby compromises fertility. 

The present invention also teaches that an animal model of fertility is 
1 0 prepared by immunizing an animal with an EP1 7 polypeptide. The resulting 
immune response in the animal comprises a production of antibodies that 
specificaily bind an EP1 7 polypeptide, thereby disrupting its biological activity. 

A method is also provided for generating an animal model of infertility 
by administering to an animal a compound that dismpts EP1 7 expression or 
15 -function. Such a compound is discovered by methods disclosed herein. 

Animal models of male infertility can be characterized according to 
several measures, including //? vivo and in vitro assays of fertility, as described 
in Examples 14 and 15 beloW, and morphological inspection of the epididymis. 

XV. Therapeutic Applications 

20 Another aspect of the present invention is a therapeutic method 

comprising administering to a subject a substance that modulates lipocalin 
biological activity. Therapeutic subistahces include but are not llnriited to 
chemical compounds, antibodies, and gene therapy vectors. 

Compounds that are discovered by the methods disclosed herein is 

25 useful for therapeutic applications related to male fertility. For example, a 
compound that mimics EP17 function, vvhen administered to ^n lrifertile male 
subject, can regulate fertility by prorhotihg spermatozoa mahjration irii the 
epididymis. Conversely, a compound that interferes with EP1 7 function can aq^ 
: . : to suppress sperrriatozoa^ n^^ 

30 The present invention also provides a method for disrupting EP1 7 

. function by immunizing a subject with an effective dose of the disclosed EP1 7 
polypeptide. The immune system of the subject produces ah antibody that 
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specifically recognizes the EP1 7 pplypeptide, and binding of the antibody to 
the EP1 7 polypeptide abolishes EP17 function. In a prefenred embbdinnent, the 
antibody recognizes sonne or all of the amino acids of SEQ ID NO:4 or 6 and 
is prepared according to a nnethod of the present invention for producing such 
5 an antibody. 

Several studies have demonstrated the utility of immunotherapeutic 
approaches to contraception and teach methods for preparing and 
administering such vaccines, including U.S. Patent Nos. 6,132,720 and 
6,096,318, Feng et al. (1999), and Naz (1999). U.S. Patent No. 6.096,318 

10 additionally discloses methods for chemical modification of immunogenic 

proteins, and fragments thereof, which elicit an amplified immune response in <^ 
a subject receiving an injection of the modified polypeptide. Briefly, the antigen 
modification is accomplished by attaching the protein to a camer such as a 
bacterial toxin or by polymerization of protein fragments. This method has 

1 5 been used to modify human chorionic gonadotropin, an antigen that is effective 
for imnnunological contraception in mammals. 

The present invention further provides lipocalin nucleic acid sequences 
and gene therapy methods for modulating lipocalin activity in a target cell. The 
gene therapy vector can encode an EP1 7 lipocalin, preferabjy comprising the 

20 amino acid sequences of SEQ ID NO:4 or 6. Alternatively, a gene therapy 
vector can include sequences encoding a nucleic acid molecule, peptide, or 
protein that interacts with an EP1 7 lipocalin. This modulation can affect ® 
spermatozoa maturation in the vicinity of a lipbcalin-secreting cell. Additionally, 
a gene therapy vector can include an EPi 7 promoter sequence of the prese^^ 

25 invention to provide tissue specific expression of a genaof interest in a sfubject 
Preferably, the EP17 promoter regions used to perform this method is the 
nucleotide sequence of SEQ ID NO:1 or 5, or funrt^^ 

Vehicles for delivery of a gene therapy vector include but are hot limited 

',1; ' to a liposbnrie, a cell, and Preferably; a cell is transformed or 

30 transfected with the DNA rhblecul^^^ 

transfected cell. An exerhplary and preferred transformed or transfected cell 
is a epididymai cell. Alternatively, the vehicle is a virus, including a retroviral 
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vector, adenoviral vector or vaccinia virus whose genome has been 
manipulated in alternative ways so as to render the virus non-pathogenic. 
Methods for creating such a viral mutation are detailed in U.S. Pat. No. 
4,769,331. Exemplary gene therapy methods are described in U.S. Patertt 
5 Nos. 5,279,833; 5,286,634; 5.399,346; 5,646,008; 5,651 ,964; 5,641 ,484; and 
5.643,567. 

The ability for adenovirus gene therapy vectors to infect male germ cells 
to the exclusion of embryos fertilized by infected sperm was demonstrated by 
Hall et al. (2000) Hum Gene Ther l 1(1 2): 1705-1 71 2. High titers of the vector 

10 were injected directly into mouse testis and epididymis, or alternatively, spemri 
were exposed to the virus immediately prior to or during in vitro fertilization. 
The vector carried the bacterial lacZ gene under the direction of the Rous 
sarcoma virus promoter, and Infection was assayed by enzymatic or 
immunologic detection of p-galactosidase. lacZ expression was assayed 

15- during the several weeks following injection, and in preimplantation embryos 
produced by in vitro fertilization with sperm exposed to gene therapy vector. 
lacZ expression was observed in sperm but not in embryos, supporting a 
conclusion that adenovirus vectors pose minimal risk for germ line integration 
when exposed to male reproductive cells. These studies teach methods for 

20 construction of a gene therapy vector and effective administration of a vector 
in the male reproductive system, as proposed for administration of nucleic 
acids of the present invention. 

The invention further provides a method for diminishing the fertile 
capacity of a subject. According to the method, a chemical compound, peptide, 

25 or antibody that interacts with a n EP1 7 polypeptide, preferably the polypeptide 
of SEQ ID NO:4 or 6, Is identified. A pharmaceutical preparation is prepared 
comprising such a chen^ical corhpound, pep An 
effective dose of the pharmaceutical composition is administered to a subject, 

^ ^^^^ :> 

30 The invention further prpyides a method for promb^ 

of a subject. In this case>:a chemical cornpound or peptide'that i with 
an EP17 polypeptide, preiferably the polypeptide of SEQ ID NO:4 or 6, is 
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identified. A pharmaceutical composition comprising the chemical compound 
or peptide and a carrier is prepared. An effective dose of the pharmaceutical 
composition is administered to a subject. Whereby the fertile capacity of the 
subject is improved. 
5 XVL Summary 

Summarily, the provisions of an EP17 promoter region,, a 3' flanking 
genomic region of mEP17, a coding sequence of hEP17, and a hEP17 
polypeptide are a significant advance in fertility-related research. The 
disclosed EP1 7 nucleic acids and polypeptides can be used according to 
i 0 methods of the present invention to generate a mouse model of male infertility, 
for drug discovery screens, and for: therapeutic treatment of fertility-related 
conditions. 

EXAMPLES 

The following Examples have been included to illustrate modes of the 

15 invention. Certain aspects of the following Examples are described in terms of 
techniques and procedures found or contemplated by the present co-inventors 
to work well in the practice of the invention. These Examples illustrate 
standard laboratory practices of the co-inventors. In light of the present 
disclosure and the general level of skill in the art, those of skill will appreciate 

20 that the following Examples are intended to be exemplary only and that 
numerous changes, modification and alteration can be employed without 
departing from the scope of the invention. 

Example 1 
Cloning of mouse EP1 7. promoter region 

25 Two clones containing 6.3 kb EcpRV restriction fragments were isolated 

from the genomic BAG clone ;1 0983: (Figure 9). The DNA fragments were 
subcloned in pBluescript SK+ (Prdmega, M^^ using 
appropriate enzymesn DNA templates fon sequencing were purified; using the 

. - "Plasmid Midi kit" (Qiagen inc., Santa Clari^^^ 

30 were performed as described in the Thermo Sequehase™ fluorescent labeled- 
. primer cycle sequencing kit: (PerkinrElmer, Foster City, California). . DNA 
fragments were separated in a denaturing PAGE (6% . aery^ 
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analyzed using an ABI 373A automated sequencer (PE, Applied Biosystems, 
Foster City, Galifornia). Nucleotide sequencbs were analyzed using the 
GeneJockey™ software available from Biosoft of Ferguson, Missouri. DNA 
sequencing analysis of both clones revealed that the 6.3 kb fragment contains 
5 5.4 kb 5' flanking region of the mEP17 gene: 

Example 2 
In s/fu Hybridization 
Nbiiisotoplc in situ hybridization was performed on 4-6 |xm thick 
cryosections of fresh-frozen mouse epididymis. Sections were fixed in 4% 
10 formaldehyde in 0.1 M sodium phosphate buffer pH 7:2 and then incubated for 
© 10 minutes in PBS containing 5 jag/ml proteinase K. See Sambrook et al, 

(1992) for a description of PBS. After two rinses in PBS, sections were 
incubated in 0.25% acetic anhydride in 0.1 M triethanolamine pH 8.0 for 15 
rhinutes. Sense and antisense ribopfobes were prepared in 20 nl transcription 
15 reactions containing SP6 (Promega, iVIadison, Wisconsin) or T7 (New England 
Biolabs, Beveriy, Massachusetts) polymerase, 1X transcription buffer, 1 mM 
each of ATP, CTP, and GTP, 0.65 mM UTP, o:35 mM digoxygenin-UTP 
'(Roche Diagnostics Corp, Indianapolis, Indiana), and 1 ^g linearized F3 
plasmid carrying the nriEP17 cDNA. Unincorporated nucleotides were removed 
20 on a Chroma Spin-100 STE column (Clontech, Palo Alto, California). 

Labeled riboprobes were denatured for 5 minutes at 80°C. diluted in 
. hybridization buffer composed of 50% (vol/vol) formarriide, 1 0% (wt/vol) dextrah 
sulfate, 4X SSC, 1 X Denhardfs reagent, 0.5 mg/rril yeast tRNA, and Incubated 
with the sections overnight at SS^C See Sambrook et al., 1 992 for a description 
25 of SSC buffer. The-slide& were washed at ix)6rri temperature fer 5 minutes in 
2XSSC, rinsed in STE bufffer (5(30 rriM NaCI, 20 mM Tris-HCI pH 7:^^ 1 mM 
EDTA), and then intubated for 30 minutes in STE cohtaihirig 40 ^g/riil RNase 
A. The sections were washed sequentially for 5 minutes each in 2X SSC, 50% 
formartiide at 5d^C, then at robm^^^^^^^ 
30 0.5X SSC. ■ ''7 ' • ■ . . ^ 

To detect hybridized probed 
Tris-HCI pH 7.5, 150 riiM NaCi).: blocked for 1 hbur in blocking solution (TN 
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buffer containing 2% horse seaim and 0.1% Triton X-100), and incubated for 
1 hour in 1:500 diluted alkaline phosphatase conjugated antidigoxygenin 
(Roche Diagnotics Corp.) in blocking solution. Slides were rinsed three times 
in blocking solution and then in a substrate buffer of 100 mM This-HCl pH 9.5, 
5 100 mM NaCI, 50 mM MgCl2. Golor development was in substrate buffeir 
containing 0.17 mM 5-bromd-4-chloro-3-indolyl phosphate, 10 mM N-ethyl- 
maleimide, and 1 mM levamisole as an inhibitor of endogenous alkaline 
phosphatase. Color development was stopped with 10 mM This-HCl pH 8.0 
and 1 mM EDTA. Sections were examined and photographed with a Zeiss 
1 0 Axiophot using both bright field and phase contrast optics. 

Examples 
Cloning human EP1 7 
To recover a full-length human EP17 gene, a hybridization screen is 
performed using a human epididymal genomic library probed with the 
15 nucleotide sequence of SEQ ID NO:2, 3 or 5, or portion thereof. Positive 
colonies are selected, a subset sequenced, and a clone corresponding to the 
full-length cDNA is recovered. Alternatively, primers from the predicted 5' and 
3* ends of SEQ ID NO:2 are used in polymerase chain reaction with a human 
epididymal genomic DNA as template to amplify a fragment representing the 
20 full-length clone. 

To recover a full-length human EP17 cDNA, a hybridization screen is 
performed using a human epididymal cDNA library probed with the nucleotide 
sequence of SEQ ID Np:2 or 3, or portion thereof. Positive colonies are 
selected, a subset sequenced, and a clone corresponding to the full-length 
25 , cDNA IS recovered. Alternatiyely, primers from the predicted 5' and 3' end? of 
SEQ ID Np:3 are used in polymerase chain reaption with a human epididymal 
cDNA as ternplate to amplify a fragment representing :the full-length clone. 

Example 4 

Primer Extension Analysis to Identify Transcription Initiation Sites 
30 Total RNA was extracted from the mouse epididymis using a method 

described previously (Chomczynski a (1 987) Anal Biochem 1 62:1 56- 

159). The EP17PE2 primer (SEQ ID NO:7) specific for mEP17 mRNA was 
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radiolabeled using T4 nucleic acid sequence kinase in the presence of 100 piCi 
[y.32pj.^jp (3000 Ci/mmol) (Amersham) according to the manufacturer's 
Instruetioris (New England Biolabs). For eiach reaction, 10 fxg of epididymal 
total RNA or transfer RNA was hybridized to 1 pmol (10^ dpm) of EP17PE2 
5 primer for 12 hburs at 35°C in 10 ^1 of a solution containing 0.04 M [1,4]- 
piperazine diethanesulfonic acid (PIPES), pH 6.4, 1 ixM EDTA, arid 80% 
(vol/vol) formarhide. 

Reverse transcription was performed in 20 ii\ containing 50 Tris-^HCI 
pH 8.3, 30 >M KCI, 8 ^iM MgCl2, 6 ixM DTT, 0.5 mM of each dNTP, and 50 
10 units Avian Myeloblastosis Virus (AMV) reverse transcriptase (Promega, 
Madison, Wisconsin). Samples were incubated for 30 minutes at 42°C, and 
then, 50 units of AMV reverse transcriptase were added again arid incubated 
for 1 hour more. Elongated radiolabeled fragments were loaded on a 
denaturing PAC3E (7% polyacrylamide gel) niextto sequencing reactions canied 
15 out using the Sequenase sequencing kit (Amersharh, USB): The clone pHindlll 
(shown in Figure 9) and the EPi7PE2 primei^ (SEQ ID NO:7) were used as 
template and primer, respectively. 

Example 5 
Chimeric Gene Expression Constructs 
20 DNA fragments derived from the BAC clone 10983 were generated 

using appropriate restriction enzymes. DNA fragments were resolved on an 
agarose gel, purified from the agarose, and ligated into the promoteriess 
pBLCAT3 plasmid (Luekbw, 1987) by standard methods. This construction 
enabled expression of the CAT gene by a mEPI 7 promoter region fragment. 
25 r Example 6 

Transient Trahsfection Assays 
Unless otherwise indicated, all rhedia, cell culture and transfection 
reagents, w^^ 

Maryland): rPG-^^^^^ cells weife cultui^ed in F12K Nutrient Mixtufe 

. 30 (Kaighn%: Modification): Dulb^cCd?^ M (DMEM) 
supplenrienfea wKh'5^ 50 jig/mf streptonnydri and id% (v/v) 

charcoal/dextran treated' fetal bovine serurti (FBS, HyClone, Logan, UT). Both 
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cultures were maintained at 37°C in humidified airwith 5% COg. Plasmids were 
prepared with the QIAGEN™ plasmid kit. Lipofefctin reagent and PLUS reagent 
(Gibco BRL) were used according to the manufacturer's protocol. Briefly, cells 
were plated at 2 X 10^ cells/well in 6-well plates the day before transfection. 
5 After 24 hours, 5 ^il of PLUS reagent, 0.5 jxg of chimeric construct, 0.5 |ig of 
androgen or glucocorticoid expression vector and 0.05 ^ig of pRL-CMV, were 
diluted in 1 00 |al of DMEM and incubated for 1 5 minutes at room temperature. 
The two solutions were combined, gently mixed, and incubated for 1 5 minutes 
at room temperature. Four or eight jil of Lipofectin reagent was diluted in 

10 DMEM and incubated for 15 minutes at room temperature. The two solutions 
were combined, gently mixed, and incubated for 15 minutes at room 
temperature. While complexes were forming, medium was replaced with 800 
III of fresh DMEM. Following incubation, the transfection mixtures were added 
to the wells. Cells were incubated for 4 hours at SZ^'C at 5% CO2. After 

15 incubation, medium was replaced with 2 ml of DMEM containing 1 0% FBS and 
appropriate hormones. After 24 hours, cells were washed once with phosphate 
buffered saline, 500 |il of passive lysis buffer (Promega) were added gnd cells 
were incubated for 15 minutes at room temperature in a shaker. The cell 
lysates were transferred to fresh tubes, centriftjged at 12,000 rpm for 30 

20 seconds to remove debris and stored at -80°C. For efficiency control, Renilla 
luclferase activity (pRL-CMV) was monitored. 

Example 7 
Transgenic Mice 

The chimeric gene comprising the 5.3 kb EP17 promoter region 
25 fragment and the CAT reporterr gene wais excised from the pUC18 vector by 
restriction enzyme digest. : DNA fragments were purified on a 0,8% (wN) 
agarose gel using theAgarACE™ enzyme (Promega). Transgenic mice (strain 
B6D2; Hartian Spragu 

into the miaile pronucleus of a fertilized bdcyte using stahdard tdchniqu^es 
30 (Palmitef and Brinster (1 985) ee//4t:343^345). Seven independent transgenic 
linescarrying the CAT repbrteir gene Caput epididymis-specific 

CAT activity was d^ected in three transgenic mouse lines. CAT expression 
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was restricted to the initial segment of the caput, epididymis as observed for the 
mEP17 gene. Thus, the 5.3 kb fragment of thfe mEP17 5* flanking region is 
sufficient of region-specific expression and can be used for heterologous 
expression in the initial segment of the caput epididymis. * 
5 Example 8 

PGR assay of Transgenic Mice 
Transgenic animals were identified by PCR-based screening using DNA 
isolated from the tail of each animal. Approximately 1 cm of the tail was 
digested overnight at 55°C in a Proteinase K digestion mix (1 0 mM Tris-CI, pH 

10 7.5, 75 mM NaCI, 25 mM EDTA, 1% SDS, 0.5 mg/ml Proteinase K). DNA was 
extracted with one volume of phenol/chloroform/isoamyl alcohol {25/24/1 ) and 
precipitated at room terinperature with two volumes of absolute ethanol. 
Samples were cehtrifuged at 10,000 x g at 4°C for 15 minutes, washed with 
70% ethanol. centrifuged at 10,000 x g at 4^e for 15 minute?, and dried for 2 

15 hours at rpom temperature. 

500 ng of genomic DNA were mixed with 1X PGR buffer 11 (Perkin 
Elmer), 2 units of Taq DNA polymerase (Promega), 1.5 mM MgCl2, 1 fiM 
concentration of each primer (primer 1 , SEQ ID NO:8; primer 2, SEQ ID NO:9; 
casein forward primer, SEQ ID NO:10; casein reverse primer, SEQ ID NO:1 1), 

20 and 0.2 mM dNTP. DNA fragments were amplified for 30 cycles (95''C, 1 
minute; 50°C 45 seconds, 72''C, 45 seconds) and 1 cycle (95''C, 1 minute; 50°c 
45 seconds; 72°C, 10 minutes). PGR products were analyzed on a 2% (vy/v) 
agarose gel. 

Example 9 

25 CAT Enzymatic Assay in Transgenic Mice . 

To monitor CAT activity, organs were dissected from a transgenic animal 
and homogenized by 20 strokes with a B pestle in a glass Dounce 
hpnipgenizer in 200 )il of 0.1M Tris-HCI, 0.1% Triton X-100, pH 7 8. Insoluble 
. rnaterial was removed by centrifuigatto fpr 5 minuteis). CAT 

30 ?ssay? were performed by the tv^Orphase flour diffusion: method a^^ 

previously (Nachtigal et al. (1 989) A/uc Acid Res 1 7:4327-4337). Briefly, c^ll 
jysate (5p.tp 2Q9 |ig) is added to a scintillation vial with a lysis buffer to giye a 
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total volume of 200 ^il. The solution is heated to eS^'C for 10 mintues, coofed 
to room temperature, and a reaction mix (75 tilj, containing 2 jil ^H-acetyl CoA 
(Amersham Phannacia Biotech), 50 ^il of 5 mM chloramphenicol (in Water), 7.5 
lal of 1M Tris-HCI (pH 7.8) and 15.5 i^i of water, was added. The reaction 
5 mixture was carefully overlaid with 3 ml of organic phase scintillation cocktail. 
After 30 minutes, the samples were counted for at least 5 minutes. 
Quantitative values for CAT activity were determined by regression analysis to 
give counts per minute, per mg (cpm/min/mg) of protein ceil lysate. 

Example 10 

10 In Situ Filter Detection 

About 1 0^ Xgt1 1 clones of a cDNA expression library are prepared frohi 
RNA containing poly(A)'^ RNA of the mouse distal caput epididymis. Clones are 
plated and replicated on nitrocellulose filters. After denaturaion and 
renaturation, the filter-bound proteins are screened with a concatenated 

15 oligonucleotide probe containing the nucleotide sequence of the cis-DNA 
regulatory element containing the nucleotide sequence of the cis-DNA 
regulatory element The probe is prepared by nick translation with a specific 
activity of >1 0® Z^ig. Duplicate screening using a probe carrying a mutated cis- 
DNA regulatory element is carried out to eliminate false positive clones. 

20 Example 1 1 

Mouse EP17 Knock-out 
To recover genomic DNA sequence necessary for homologous 
recombination, a 129/SvEv mouse genomic DNA library was screened using 
mE-RABP cDNA as a probe. BAG clone 1 70K23 was isolated, having 5.3 kb 

25 flanking refc|ion and all axons of the mEP17 gene. The targeting vector 
comprises a 5.3 kb EcbRV-Sall fragment of the 5' promoter region' and a 1 .9 
kb 3' fiarikirig region (Figured). The entire mEP17 coding region is fejDlaced 
With a PCkneomybin cassett^^ 

so that the P<3kheonriyclh cassette i^ pbsitibhed between the 6' prornoter 
30 region and the 3' flanking fejgidn. The targeting vector is iinejaliied^'u^^^ 

appropriate restrictibri enzyme, and the linearized vector is ejec^ropofeted^^i^ 
TL1 embryonic stem (ES) celis. ES cells are selected based 6n demonstrated 
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resistance to geneticin after 24 hours. Resistant cells are further screened by 
Southern blot arialysis using a probe designed according to sequence of the 
targeting vector. 

Clones bearing the transgene are injected into blastocysts according to 
5 standard procedures (Jpyner (1 993) '"Gene targeting- A Practical Approaoti" 
IRL Press, Oxford). Chimeric mice bearing the transgene are crossed with 
C57BL/6 females and agouti offspring are analyzed by PCR and Southern blot 
analysis for presence of the targeted allele. mEP17 homozygous mutant mice 
are obtained by crossing heterozygous mice having one native allele and a 
1 0 knock-out allele. mEP17 homozygous mutant mice are confirmed as such by 
demonstrating a loss of mEP17 expression by standard methods, including 
Northern blot analysis, RNAse protection assays. Western blot analysis, and 
immunohistochemistry. 

Example 12 

15 Recombinant Production of EP17 protein 

The mature protein coding sequence was cloned into the prokaryotic 
expression vector pBAD/glll (Invitrogen). To simplify purification, the 
pBAD/glll vector encodes a leader peptide which directs the recombinant 
protein into bacterial periplasmic space, thereby minimizing any potential toxic 

20 effect. The pBAD/glll vector also encodes a C-terminal polyhistidine tag for 
detection with an anti-His antibody and for purification with ProBond resin 
(Invitrogen). The pBAD/glll vector carrying the mEP17 coding sequence was 
transformed into E.co// according to the manufacturer's conditions. 
- Transformed E.-co// were cultured and recombinant protein was extracted. To 

25 confirm the production of mEP17 protein, protein derived from transformed 

E.coli was resolved on a polyacrylamide gel and Western blot analysis was 

* ■ ■ 

performed according to standard techniques. 

Example. 1 3 

^ . In v/fro Bindinct Assavs 

30 . Recombinant protein is obtained, for example, according to the approach 

desehbed in Example 12 herein above. The protein is immobilized on chips 
appropriate for ligand binding assays. The protein immobilized on the chip is 
exposed to sample compound In solution according to methods well known in 
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the art. While the sample eompound is in contact with the immobilized protein, 
measarements capable of detecting protein-ligand interactions are conducted. 
Measurement techniques include, but are not limited to, SEDLI, biacore, and 
FGS, as described above. Compounds found to bind the protein are readily 
5 discovereid in this approach and are subjected to further characterization. 

Example 14 
In vivo Fertility Assay 
Five wild type heterozygous transgenic and five mutant homozygous 
transgenic male mice are individually mated with five wild type C57BL76 
10 females during an ovemight interval. Females exhibiting a vaginal plug the 
next morning are isolated. If pregnancy occurs, the genotype of the offspring 
are analyzed by PGR amplification of tail DNA using primers to detect the 
transgene. A lower percentage of pregnant females resulting from mating with 
homozygous mutant males suggests male subfertility. 
15 Example 15 

In vitro Fertility Assay 
The method used is essentially that of Wolf and Inoue (1976). In brief, 
male mice are killed, and each cauda epididymis is rapidly excised and minced 
in 1 ml of Toyoda's medium pre-equilibrated at 37°C under 5% (vol/vol) carbon 
20 dioxide in air. The minced tissue is left at 37*^0 for 30 minutes before the 
tissues pieces are removed. An aliquot is taken for sperm counting, and the 
incubation is continued for a further 30 minutes. Female mice are induced to 
superovulate by injections of PMSG and hCG. The female mice are killed, and 
their oviducts are removed and placed into Biggers, Whitten, and Whittingham 
25 : medium (B WW). Under a microscope, the oviduct is pricked, and the cumulus 
mass is removed and treated with hyaluronidase. The denuded eggs are 
washed through three changes of medium before being allotted to 100 ^1 
droplets of medium under silicon oil; 

Approximately 10^ spermatozoa are added to each drop (i.e. 10^ 
30 spenm/nil), and the dishes are incubated at 37X under 5% (vol/vol) carbon 
dioxide in dirfor 5 hours; At this time, some eggs are removed and washed by 
repeated itiierdpipetting before the number of attached spermatozoa is scored. 
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The rest of the eggs are transferred to fresh medium and incubated for a 
further 24 hours, at which time eggs are scored for evidence of fertilization and 
development Experiments are conducted with spermatozoa from a mutant 
and a wild type male and eggs from a common pool of females. 
5 Example 16 

Regulation of EP1 7 Protein 
The regulation of mEP17 protein expression was investigated using 
Western blotting and immunohistochemistry in castrated mice, castrated 
testosterone supplemented mice, unilateral castrated mice, unilateral 

10 cryptorchid mice* and busulphan-treated mice. As previously observed at the 
mRNA level (Figures 15A-15e), rnEPI? protein disappeared from the initial 
segment two days after bilateral castration and was not restored by 
testosterone treatment. Similariy, after unilateral castration, mEP17 protein 
disappeared from the castrated side, but not from the non-castrated side. 

1 5 These data suggest that mEPI 7 is not regulated by circulating androgens, but 
can be regulated by testicular factors provided via the efferent ducts. 

To determine whether germ cell-associated testicularfactors can regulate 
mEP17, spermatogenesis was disrupted using cryptorchidism or busulphan 
treatment. One month following cryptorchidism, mEPI 7 protein was not 

20 detected in the initial segment of the cryptorchid epididymis but was detected 
at normal levels in the scrotal epididymis. In cryptorchidism, the testis and the 
epididymis are exposed to abdominal temperature. To distinguish the effects 
of abdominal temperature on the testis or epididymis, spermatogenesis was 
alsodisrupted by busulphan treatment. Following a 35-day treatment, the level 

25 of mEPI 7 protein was drastically reduced when compared to untreated 
controls. Collectively, these dbservatior)s suggest that mEPI 7 is regulated by 
germ cell-assdGiated factors. 
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CLAIMS 

What is claimed is: 

1 . An isolated promoter region, comprising an about 5.3 kb fragment 
(Accession No. AF082221 ) of mouse genomic clone 1 0983 (Genomesystem 

5 Inc.. St. Louis. Kentucky) between the EcoRV arid Sail restriction sites, or 
functional portion thereof. 

2. An isolated promoter region of claim 1, the functional portion 
comprising a TATA box and at least one cis-acting regulatory sequence 
selected from the group consisting of a Spl -binding site, an AP-1 binding site, 

1 0 a retlnoic acid receptor binding site, an andrbgen receptor binding site, a C-Ets 
binding site, a SRY binding site, an AP-4 binding site, a C/EBP binding site, 
and combinations thereof. 

3. The isolated promoter region of claim 1 , comprising: 
(a) the nucleotide sequence of SEQ ID NO:1 ; or 

5 (b) a nucleic acid molecule substantially identical to SEQ ID 

NO:1. 

4. The isolated promoter region of claim 1, comprising a 20 base 
pair nucleotide sequence identical to a contiguous 20 base pair nucleotide 
portion of SEQ ID NO:1 . 

20 5. An isolated promoter region, or functional portion thereof, 

comprising an about 51 50 base pair region immediately upstream of the human 
hEPI 7 transcription start site. 

6, The isolated promoter region of claim 5, the functional portion 
comprising a TATA box and at least one cis-acting regulatory sequence 

25 selected from the group consisting of a Sp-1 binding site, an AP-1 binding site, 
a cAMP response element binding protein (CREB) binding site, a SRY-related 
HMG-.box gene 5 (Sox5) binding site, a Sex<letenminlng region Y gene product 
(SRY) binding site, a c^Ets binding site, a GATA binding site, an Octamer 

. ; transcnption factdrl (Oct-1) binding site, and combinations ther 

30 7. The isolated promoter region of claim 5, corriprising: 

(a) the nucleotide sequence of SEQ ID Nb:5;: or - 

(b) a nucleic add molecule substantially identical to SEQ ID 
NO:5. 
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8. The isolated promoter region of claim 5, comprising a 20 base 
pair nucleotide sequence identical to a contiguous 20 base pair nucleotide 
portion of SEQ ID NO:5. 

9. A chimeric gene comprising the isolated promoter region of claim 
5 1 or 5 operably linked to a heterologous nucleotide sequence. 

10: A vector comprising the chimeric gene of claim 9. 

11. A host cell comprising the chimeric gene of claim 9. 

12. The host cell of claim 1 1 , wherein the cell is selected from the 
group consisting of a bacterial cell, a hamster cell, a mouse cell, and a human 

10 cell. 

13. A mouse comprising a mouse cell of claim 12. 

14. The mouse of claim 13. wherein expression of the chimeric gene 
confers male infertility in an othenA^ise fertile mouse. 

15. A method for identifying a substance that regulates EP17 
15 expression, the method comprising: 

(a) establishing a gene expression system comprising the 
chimeric gene of claim 12, wherein the heterologous 
nucleotide sequence is a reporter gene, and components 
required for gene transcription and translation, whereby 

20 the reporter gene is expressed, and a level of reporter 

gene expression is assayable; 

(b) assaying a baseline level of reporter gene expression 
using the gene expression system of (a) in the absence of 
a candidate substance; 

25^ (c) exposing the gene expression system of (a) to a plurality 

of candidate substances; 
(d) assaying a level of reporter gene expression 

gene, expression system • of (a) iri the presence of a 

candidate substance of (c); arid 
30 (e) selecting a candidate substance whose presence results^^^ 

in an altered level of reporter gene expression when 

compared to the baseline level. 
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16. The method of claim 15, wherein the substance is a protein, a 
chemical compound, or a peptide. 

17. A method for identifying a substance that regulates EP17 
expression, the method comprising: 

(a) creating a transgenic mouse bearing a chimeric transgene 
comprising the promoter region of claim 1 or 5 operably 
linked to a reporter gene, wherein the reporter gene is 
expressed, and wherein a level of reporter gene 
expression is assayable; 

(b) assaying a baseline level of reporter gene expression in 
the transgenic mouse in the absence of a candidate 
substance; 

(c) administering the candidate substance to the transgenic 
mouse; 

(d) assaying a level of reporter gene expression in the 
transgenic mouse following administration of the 
candidate substance to the mouse; and 

(e) selecting a substance wherein the level of reporter gene 
expression in the transgenic mouse is altered following 
administration of the substance compared to the baseline 
level. 

18. A method for producing an epididymal cell line, the method 
comprising: 

(a) creating a transgenic animal bearing a chimeric transgene 
25 gene comprising the promoter region of claini i operably 

linked to a selectable marker gene that permits cell growth 
in the presence of a selective agent; 
. ' (b) ; procuring epididymal cells from the transgenic animal of 

; - ' (a); and 

30 . ; (g) reproducing the cells in vitro \n the presence of the ; 

- selective agent 

19. A method formutagenizing an EP17 locus in a vertebrate animal, 
the method corhprising: 

VSOCX^ID: <WO_0205370lA?.U> 
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(a) constructing a targeting vector having tlie isolated 
promoter region of claim 1 , a marker gene, and an 
isolated 3' flanking region of an EP17 gene, wherein the 
marker gene is positioned between the promoter region 

5 and the 3" flanking region; 

(b) linearizing the targeting . vector by digestion with a 
restriction endonuclease, wherein the promoter region, 
marker gene, and 3* flanking region are undigested; 

(c) introducing the linearized vector into embryonic stem 
10 cells; 

(d) detecting the marker gene in the embryonic stem cells; 

(e) selecting embryonic stem cells having the marker gene; 
and 

(f) generating a transgenic vertebrate animal derived from 
15 the selected embryonic stem cells, wherein the EP17 

locus of the animal is altered as a result of a homologous 
recombination event mediated by the targeting vector 

20. An isolated EP17 polypeptide, or functional portion thereof, 
comprising: 

20 (a) a polypeptide encoded by the nucleotide sequence of 

SEQIDNO:3; 

^ (b) a polypeptide encoded by a nucleic acid molecule that is 

substantially identical to SEQ ID NO:3; 

(c) a polypeptide having the amino acid sequence of SEQ ID 
25 . NO:4; 

(d) a polypeptide that is a biological equivalent of the 
polypeptide of SEQ IDislO:4; or 

(e) a polypeptide which is immunologically cross^reactive with 
an antibody that shows specific binding with:a poly^ 

.30 bfSEQIDNO:4 

21. An isolated nucleic acid molecule encoding a human EP17 
polypeptide. 

22. The isolated nucleic acid molecule of claim 21, comprising: 
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(a) the nucleotide sequence of SEQ ID NO:3; or 

(b) a nucleic acid molecule substantially identical to SEQ ID 
NO:3. 

23. The isolated nucleic acid molecule of claim 21 , comprising a 20 
5 nucleotide sequence that is identicial to a contiguous 20 nucleotide sequence 

ofSEQIDNO:3. 

24. A chimeric gene, comprising the nucleic acid molecule of claim 
21 operably linked to a heterologous promoter. 

25. A vector comprising the chimeric gene of claim 24. 
10 26. A host cell comprising the chimeric gene of claim 24. 

27. The host cell of claim 26, wherein the cell is selected from the 
group consisting of a bacterial cell, a hamster cell, a mouse cell, and a human 
cell. 

28. A method of detecting a nucleic acid molecule that encodes an 
15 EP1 7 polypeptide, the method comprising: 

(a) procuring a biological sample having nucleic acid material; 

(b) hybridizing the nucleic acid molecule of SEQ ID NO:1 , 2, 
3, or 5 under stringent hybridization conditions to the 
biological sample of (a), thereby forming a duplex 

20 stmcture between the nucleic acid of SEQ ID NO:1 , 2, 3, 

or 5 and a nucleic acid within the biological sample; and 

(c) detecting the duplex structure of (b), whereby an EP17 
nucleic acid molecule is detected. 

29. An antibody that specifically recognizes ah EP17 polypeptide 
25 encoded by the nucleotide sequence of SEQ ID NO:3; a polypeptide encoded 

by a nucleic acid molecule that is substantially identical to SEQ ID NO:3; a 
polypeptide having the amino acid sequence of SEQ ID NO:4; a polypeptide 
that is a biological equivalent of the polypeptide of SEQ ID NO:4; or a 
" polypeptide which is imrhunologically cross-reactive with an antibody that 
30 shows specific binding with a polypeptide of SEQ ID NOs:4. 

30. v A method foDprodueing an antibody that specifically recognizes 
an EP1 7 polypeptide, the method comprising: 
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(a) 



reoombinanuy or syn,h.«ca,,y . producing an EP17 
P^'yP-P'We of Claim 20. or portion thereof 
i":r'^'"^*-"^^— •iisane^ec.e 

~ animmune response in .he anima, co J jn! 

.„Th T are pZ' 

-nfte blood serum ofthe animal; and 

~g .He blood serom from me anima, o,(c, 

comprising "^'"""^ ^ °' ^^^^ «,e me.bod 

<a) a biological sample having pep«iom 

(a)by,mmunochemical reaCionwKh .he anybody of cLim 

Tdrrr"""''^^------:: 

=>EQ ID NO:4; a polypepBde that is a biological 
~nt.of the polypeptide of SEQ ID NO.4 oTl 

^ ^ -^"^"^ •<> a pi^^iity o,, : 

------ 



wo 02/053701 



PCT/USOl/49639 



-80- 

(d) selecting a substance that demonstrates specific binding 
to the isolated EP17 polypeptide. 
33. A method for modulating EP1 7 function in a subject, the method 
comprising: 

5 (a) preparing a pharmaceutical composition, comprising a 

substance identified according to the method of claim 1 5, 
1 7, 30, or 32, and a carrier; 
(b) administering an effective dose of the pharmaceutical 
composition to a subject, whereby EP1 7 activity is altered 
10 in the subject. 

34. A method for modulating EP1 7 function in a subject, the method 
comprising: 

(a) preparing a gene therapy vector having a nucleotide 
sequence encoding an EP1 7 polypeptide or a nucleotide 

15 sequence encoding a nucleic acid molecule, peptide, or 

protein that interacts with an EP17 nucleic acid or 
polypeptide; and 

(b) administering the gene therapy vector to a subject, 
whereby the function of EP1 7 in the subject is modulated. 

20 35. The method of claim 34 further comprising the EP17 promoter 

region of claim 1 or 5. 

36. A method for diminishing the fertile capacity of a subject, the 
method comprising: 

(a) identifying a chemical compound, peptide, or antibody 
25 that Interacts with the polypeptide of SEQ ID NO:4 or 6; 

(b) preparing a pharmaceutical composition comprising the 
chemical compound, peptide, or antibody of (a) and a 

: ^ carrier; and 

. - (c) administering an effective dose of the pharmaceutical 

30 composition to a subject, whereby the fertile capjacity of 

the subject Is diminished: 
37. A nriethod for promoting fertility in a subject, the method 
comprising: 
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(a) identifying a chemical compound or peptide that 
interacts with the polypeptide of SEQ ID NO:4 or 6; 

(b) preparing a pharmaceutical composition comprising the 
chemical compound or peptide of (a) and a carrier; and 

5 (c) administering the pharmaceutical composition to a 

subject, whereby the fertility of the subject is improved. 
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Figure 2 
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Figure 3 
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Figure 5 
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Figure 6 



Sequences producing significant alignments: 

sp|P0691l|ERBP_RAT EPIDIDYMAL-RETINOIC ACID BINDING PROTEIN... 
gb|AAA41127.l| (M12790) secretory protein B prepeptide [Rat... 
pdb|lEPA|B Chain B, Epididymal Retinoic Acid-Binding Protei . . . 
gb|AAD09351.lj (U6B381) epididymal retinoic acid binding pr... 
^^iS^ ref |NP_031973 .1 1 epididymal retinoic acid-binding protein; ... 

spl 002 8 53 I PGHD_BOVIN PROSTAGIANDIN-H2 D-ISOMERASE PRECURSOR... 

gb| AAG30028 . 1 1 AF281353_1 (AF281353) prostaglandine D syntha. , . 

dbj lBAA12075.l| (D83712) Prostaglandin D Synthase [Xenopus ... 

iref (NP_000945. l| prostaglandin D2 synthase (21)cD, brain) [H. . . 

pir||S52354 gene cpl-1 protein - African clawed frog >gi|66 — 

gb|AAA36494 ^l| (MSl^OO) prostaglandin D synthase [Homo sapi... 

dbj |BAA8 6198.l| (AB03248Q) prostaglandin D synthase [Macaca. . • 

sp I P3 5 5 7 8 I ESP4_LACW EPIDIDYMAIi SECRETORY PROTEIN IV PRECUR . . . 

sp|Q9XSM0lPGHD_SHEEP PROSTAGLANDIN -H2 D-ISOMERASE PRECURSOR... 

0.001 

sp I Q2 94 87 1 PGHD_FELCA PROSTAGliANDIN-H2 D- ISOMERASE PRECURSOR . . . 
gb|AAB27607.l| beta-trace protein, prostaglandin D synthase.... 
prf 1 1 1917182A androgen-dependent epididymal protein [Lacert... 

dbj |BAA94 343.l| (AB04099X) prostaglandin D synthase [Orycto... 

sp|Q01SB4|LIPO_BUFMA LIPOCALIN PRECURSOR >gi 1 104284 |pir | |S2 . 

pir||A45542 beta-lactoglobulin I - pig >gi 1 406773 | emb|CAA38 . 

ref |NP_037147.l| prostaglandin D synthase (Rattus norvegicu 

sp|O09114|PGHD_MOUSE PROSTAGLANDIN-H2 D-ISOMERASE PRECURSOR... 

gb| AAC624 87.1 1 (AF093791) androgen -regulated epipidymal sec 

#gb|AAA53371.l| (M25784) quiescence-specific protein (Gallus... 
sp|Q9XS65|PGHD_CANFA PROSTAGI*AHDrN-H2 D-ISOMERASE PRECURSOR..,. 
gb|AAF35B94.1|AF229030_l {AF229030) lipocalin Q83 iCotumix. - . 
sp|P04 939|MUPM__MOUSE MINOR MAJOR URINARY PROTEIN 15 PRECURS . . . 
sp|P07 3 80 |LACA__HORSE BETA-LACTOGLOBULIN II PRECURSOR, MINOR. . . 
sp|Q2 9095|PGHD_PIG PROSTAGLANDIN -H2 D-ISOMERASE PRECURSOR (... 
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Figure 7 



« ivitpiz and sequence from 
numan chromosome 9q 



MEP 17 



hMEP17 



MEARLLSNVCGPP; 



* ** 



LVFLLQAESTRVE-LVPEK 

32 



MEP it^iaSSo: 



33 



J3 



75 



exon3 



76 



MEP17 fG^ESSLGSERDTVGEPAPpI' 



****/ * . . 

I *** * * ***** 



GKFAPPG 

100 



eKoii4 ^00 



134 

«"Wv»iOXij.:HVI,KYP 
* **** ****.** 

I-MWRGRNFRVUCYPT 

135 



135 

^P17 RStENEDJsp 
**** * 



164 



* *** *** *** ** * 

eiW^ELTADTGLYlAAllPG 

165 



'65 




173 



'dentity=: 61 % 
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Figure 8 
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Figure 9 



Restriction enz/me map of the E57, pE12, pHKndlll and pXbal plasmld^ 
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Structure of MEP 17 gene and id£-RXBP gene 

MEP 17 ttut inE-ILA.BP jcbt ' 

— 1 — mih^ — +1 — i-HH- 
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Figure 10 



C T A G Ep tRNA 
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Figure 11 



Nucleotide sequence of the MEP17 5.3 kb promoter fragment 

^S^-j^r^M^^iemCCMSTttCtGCKTXT^^ t'CL riC I C CCXTCgTCCACT AIC I n ItXC TXTXTCCCCACCXCICXTATAC 'Ml * 

CJtfilUUUrrCCCACATTCATnTICACX U l CC l IC rJUXAgrTTTTACTCCCACCACCAXTCCJlCCA C ft, I lUCCl lt>lfcl iCC ATA -»0»* 
TXAC Xl U m 1 1 1 ICtJ ^CACAC U.1 1 tt. ICICIA T | ai&& l! UlCia i S ClCC*AClCACTnTOCMT -t»i4 



Ci.cjjaTccccc 



CCAACItXTCCCATOAACCCCTtXCCCACCACX 



•AAC A 1 1 1 i i r tX XAAAXACXTTTATTT XtXATXATMAT ATCTAACTACA 

.TTTCAACTCACCATCTTTCCA Maac^cTC Maasxc 1 1 



*«tt4 



[OXTAACTAKCCACCrr 



rrCTAC L lUtV I iC ACACACXCCACAACACCCCATCAC AJSaEaHACM&TCCTTCtCACCCACCXI 

Oct-X ^ 

;;ji.*''ACTCACCCATCTCTACACCCCATCACrrCA <.l I I I fC AATCTtATCCATrCTCACACTTCrAACATCCAATCTCACACT W | f I jO* M I'^t^TC' 

OCt-1 

gAACA i I IC i I tA ACCCCTTCTCACCCATTCCXCATIl i c J C l i lAACTACCXAACTCTrTACTAAL I CCi I I V i l U- LI CIC ACACCCAH^Ti^AAw^i ivACACATAA' 

3CATCCCCTACCTCCTAACACCCCACTCCCCACCCCCTCAACCCCTTTTCACACrr^ 

yrASCCCACCACACCACC I CC CT ACCTCXTCCCTTCACTrCTCCAOCATCTACClCTXCXTtCCCTAGAC CCC C fi CI ATCTCCACCXCCCCTCACCACCCCCTCATCCCCATAArrCT 
r?LiCCACCTCrCTtXCCATrCTACCCCTCACCAOCACCCAACCXCCTAC | 1 XUI GltnCTCACACAXACCACCAAAATCAXTC XTClTCCC Xt CTC .CI>i I IC ICl C CleT AACCACC TAC 
Tt:; ^ : C 7 C C ICACTCACCACTCCCAACAI ICr CCC CCTATCTCCACCCTCA Ci U iCt lUlC ACCTTCCCTC ACCAAr^KTCCTATCCCrCAAATCTtnt;ATCCCTACTACCTCTCCACT 
— AT^CTTATCCriTrACCTTACACATCTCrKrrACAACAIXgCA CCCi lU CCCCCAT ^ATAATAAQCCAAQQCAAACACACACAAACAAI IC IC 11 i lACTCCAACACACC 

oct-1 - 

CAACTCACCAATCCAt VlCiC lCCACCCTACACCACCXCTCnCCCAATCTCCJ^ i iCIUUiCi i i t>l lt>lt«l It ICCCCACTCTTAATCTTCACTCCXTCXCTXA 

TA ;v . ' .XC IC me i tC iCC CC Alt.nt, i V IC IC AAirCTrCCATrCATAAOCATC C IC tC U TCC CTTACCOCCATCTCAACCTG IC itI CACAAA C C I i C 1 1 CA ACTAACAT ATTCTT 
TSCASCTIClCCCTtXCCCUaJUCTATrCAATCC^^ It. i ICCCAACAAC 

~.^^j^^ricxCiJ<MCTKTTriTX:XTCrK^^ 2SCnaTCCCCi!^SXWXCAACCAAACCAATCTACXAA 

-#-pKCCAATTCCTACCCAAOCTCAtTCCTAACAArrrTCCCACTCACJUJk.CCCACXCrACT COQCQTQQOTCTXXTACCCTXT 

ip-1 

SCC7TXTCCTCCCXTCCXCA C 1 1 tC IX A J XTl IC I i IC ll f TCtCAKCTCC TX Ci iC rCI CCX TCXTOCTACTriCCACCTCXT<UTXCCCCg TrAmn\Tr n-CIC 1 iCCCTC 

Oce-1 

CrTACCCTCAATGCL ItX IC U 1 ItJ lCXCTCCATCCATCTXXACACCTCI CC TCXCCrTXC Ait,! ICIcia ^XTCXTCTXCrTTTCXXAXCCTXXCXAcrrxCACTCXATCTC 

ccrrgTC At II ntcii.1 i ACTXTC c i c i ic e i c ic i c n c i cacccxctcccxacccttxccccttxctxctcctctxxcaxt^ adotctctatootctc 

AJIA3 

TtiCCSJUarTPrcri^XCATOrATCTCT^'^ l ll l IAL ItXl riATACXTCC-Ii ICCTCXCXCCXTTXICI i ICC l ICCC 1 1 iUICCCi l&CCXCCC 

TTTCCCCCCAr jTC tn ft 5!?r rrgTTCTOACTATCCACCXTCTTICXCXXi^^ iCICCCC 111 iX CCCTACCXTCXACCTCCA Cl ICCC XACCTCXXCXCTCrCA 

CTCTCACTCTCATCrrrXCCCACTCXCACXTC C I C I ICCC ICC CAC C lix. I ICCCI CC I C i C i ICC I tC I C 1 ICC IC I i C I CC II iCA CCCAATAX^CXCATTCACCATC CCCAT CCCTC 
rrrCACCCTTTCCXCX'TCCCC-rT CCCC AeATCAT AC fCXACT C i i IC I IC XTCCCXeCCCTCTCCCCXCTTXCe CI IC TCCT CC XCACTCCCCACCATCTTTCATrCAAAl I iCClCAXC 

xxae 

,C C ICC XTCCXCXCCCXXCXCTCXCCXCXCCXTXTCrCCrXTCCXCCACCTCXCX C ICiC 1 1 ICX XXCCXACCTCCCTXCTCACCXTCCCrtCTXCCCTCTXCCC ICCC I ICCC 
7r7TASJC;XTrCTXTCTXTCCXTXTATCXXCTCCX A ACC>CC»CCCCaiTAA1tXT^^ n^ TCTrCCACAAA J fcCri ICCA CCTTTCCCACCCTAAC 

AAXTACCTrXCCCCACrCCtCACTCTtXTTCyUlCXCCTrcCrXC^ 

?^5ACiCTCACCCACACAACCTCCXCXCCTTXCTCXXCCTCACXCta.TCCXCAXXT^ ItCICl ICXXCAACXTCTCCCCTXCCCCCCTTCC 

rrrrCXCCTATTXTCX C IU> IC II^CIC l llJ lTCTCXCXCAXTTCCCXTTCTCCXCTCCXTCTXCTTCTXCTre ICCriC lUC ICCXTTCCT 

TX:. . UlC ZCiCCICC ICA CACXACAACAACTCCAACCAATrACCCCTCATACCn CC T C C C C iCA i m CXt XC Al ^ 1 1 ICCC TC 

XCTrCXCCCXXACCC C ICIC I CT X TCCCC/ACCCC CC I CC CTrCtCA ClCICi lUI I CCCXCCACCACCCCCTCXt,CUCCC ICCCCATCCC CCXXA CICI I ICCATCCTICCI iCIAC T 
CrrgXCTerCC A l IC li: l C CTAXCTCXtC tCTTXItXCClCC I CX CXCCCCXCCCTX XIC IC I iC ICICCCaCC l ICC C IC T C CACCCTOCCTTCACTCCXTI ICC ICICXCCTCTCCX 

A^-1 . 

;C?CTTCACCACCTCCXCCTt»?TCCXXAXCXTTXCAC lU I I C C I XCXCCXTCACC CC T C CI C XCCTCCCTXCA CC ICH HCLCC ICCCCI'CCCACACTCXACAOCTXI I IC IC ICCIX 
rxCXACTTCCXCCrrACXCATTTCCCTCXCAX CKrrO*CTIU^CTC XTrTTCCCCCT irrC)iOTC»OC CACCn C fi laiX TTCXCC C ' ICC I ICX TC U ICCC IC n I XICC IC I nC ff ICC 1 1 

X>-1 A*-l 

rrrX7TTx:CACCAAXTXCACXATTCTCACJL\XXXCTAATAT^^ I ' l l 1 1 iX TTTTCCTACCCXATCCCXCX C J I I Cl l J JX CTXTCXATCXCXXATXXAClTCATTrTXXACXTAA 
AX ., I IC IC ICCC I I XAAAXAAT IX. IC iCI CC I C TCACA U.It.1 1 101 HCI ' l IC IC ACACCTCTCAAXXAC CHCiCCI I QOOCCTOOCXA CATACC-rCXCCTCCTXAACTXC H IC 1 IC 

a^CXACCCXTATXAXACCCACCC X ICCICC T C iC I XTTrCAXXTCACXTCCTCX CTCC ICC XCACACACXCXTACTCCC AH. IC IC CCCCTCACTCCCTACCCACXCTACXCTXCTCACCA 
CCrrrCAWrrrxCTXXCXXXCTCTCTCTCXXXXXXTXXCTAXCCXCCCXXT^ 

AXCTXACTAXTCXXCGXXXXCXXXTCXAAACACXXACTCACTrCTACTAXXCCXACXXCXCCTCXC^^ 
SXXCXAXCXCXTCXACACTATATACACACUUyiCACXCXCJXACACAAATACCACAXXCXT^ 
XTXTXACACXCACATXCACACXTXCCACXTXCAXXCXTXTXTXACXCIICCACXCXCXTACAC^^ 

XTXXCfcTATXX LX. I I ' d. IfcC I HIC XCACTC A tHJCC I CC I I I C X C Tl€XAT Xi H ICi I Al IC 1 H IX XGCTTXTTCTTACAAXTCCfcCXX^CTCXCXTAXXTXCCriCTCC fcSZZS&T 



ACCTCXC ACCC TeooccrrCOA L I ICUl ICCICg XCTCrrACTCCXTXCTICATCfcCTCTCCACXTCCrXCawTC^^ iC CX 

rrxcccrrtcxrxtxArrrTVJ<ccAC*j<ccT^ ^ ickc xxtcxcattctctaa cicci i&c acatccttccacactctccaxa iximcctcccac 

. . TXTA box 

nrccxTxxxc. 



-OJ4 

-oi« 

•>tS4 

-nil 

->«!♦ 
*>*»4 

-J254 

01)4 

-)014 
-J494 

•3414 

-2«4 

O0»4 
-l»J4 
-IB14 
-1414 

-X4S4 

-1M4 

•ID94 
-»'>4 

-154 

*(14 
•4)4 
-114 
-394 

-1](4 



r" " r 



xcCjuoccxi 
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icxx«i>fc ■Mvco rrfcv rfc i.oxk« tiiv» t». v > 



TACTCOCCCITXXTCCXAA CT TCCA C X C reCX C CCACCXCACTCTAI 



CATCTAACTCCT 
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T'igure 12. 



Chimeric coiulrocii of MEP 17 promoter lieaird lo Iht CAT rcporter 
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Figure 14 
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. SEQUENCE LISTING 

<lio> Orgebin-Crist; Marie-Claire 

Lar eyre , Jean - J acgue s 

Natusik, Robert 

<120> EPIDIDYMAIi LIPOCALIN GENE AND USES THEREOF 

<130> ATTY DOCKET 1242/40 

<160> 11 

<170> Patentin version 3.0 

<210> 1 

<211> 5315 

<212> DNA 

<213> Mus Tnusculus 

<220> 

<221> S'UTR 

<222> (1) . . (5315) 

<400> 1 



gatatctagc 


atgtttccag 


tttctggata 


ttttgaataa 


agctactatg 


aacacagttg 


60 


agccagtgtc 


cttgtgggat 


ggtggagtat 


cttttgggta 


tatgcccagg 


agtgatatag 


120 


cagggtcttg 


gggtagaact 


atttccagtt 


tattgagaaa 


ctgccacatt 


gatttttcag 


180 


^9tggttgta 


ccagttttta 


ctcccaccag 


caatggagga 


gtgttecctt 


gtgttccata 


240 


ccctgtcact 


tgaggtgttt 


ttttgttttt 


gttttaagat 


tttttttttc 


gagacagggt 


300 


ttctctgtat 


agtcctgtct 


gtcctggaac 


tcactttgta 


gactaggctg 


gcctagaact 


360 


cagaaatccg 


cctgcctctg 


cctcccaag't 




o-w-gy uy ugcg 


^ccLccacacc 


/I O ft 


cggctgtttt 


aagatttttt 


taaaaaaaga 


tttatttata 


ataataaata 


tctaagtaca 


480 


ctgtagctgt 


cttcagacac 


accagaagag 


ggcatcagat 


ctcattacag 


atggttgtga 


540 


gccaccatgt 


ggttgctggg 


atttgaactc 


«^99atctttg 


gaagagcagt 


cagtgctctt 


600 


aaccactgag 


ccatctctac. 


agcccatcac- 


ttgagttttt 


gaabcttatc 


cattctgaca 


660 


gttgtaagat 


ggaatctcag 


agtccttttg 


atttgcatct 


ccctgataac 


taaggacgtt 


720 


gaacat.ttct 


ttaagggctt 


ctcagccatt 


cgagattcct 


gtttaactag 


gaaactcttt 


780 


actaactcct 


ttcttgcctg 


tcacagccac 


tgtgaagggt 


tcacacataa 


tcgcttctct 


840 


gcatcgcgta 


ggtcctaaga. 


ccccagtccc 


caggccctca: 


.aggcGttttc 


agacfctctac 


900 


ccaacaatgg 


gtcttctggg 


taacatgatrg 


gtaacaccac 


ccaaatcagt: 


tactagttga 


.960 


gctagcccac 


cagagcacct. 


gcctaggtca 


tcccttcagt 


tctggagcat 


gtaggtctac/ 


1020 


attgcctaga 


cccccttcta 


tctccagcac 


cggtgaggag 


cccctgatgg: 


ggataattct 


1080 


ctaaccagct 


ctgtggggat 


tgtaggggtg 


agcaggaccc 


aagcacctag 


tgggtgtgtc 


1140 


tgagagaaag 


gaggaaaatc 


aatgatcttc 


ccatgtgcgt 


ttctctgggt 


aaggaggtac 


1200 
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tcaggtgtcc tgagtgaccai gtgccaacat: ' tctcgggeta 


tgtgcagggt 


gagttttctt . 


1260 


ctgagcttgg gtcagcaaga tcctatccct gaaatctgtg 


atccctacta 


cctgtcgagt 


1320 


ctcatgctta tggttttagc ttagacatgt ctggtagaag 


atgccaccct 


ggtgtcttgg 


1380 


ggccatcata ataagccaag gcaaagagac agaaagaatt 


ctcttttagt 


ccaagagacc 


1440 


gaagtcagga atccatgtgt gtggaggcta gaggagggct 


cttgccaatc 


tgcaccccca 


1500 


acccccagct tctggtcttt cttctgttct gggcagtctt 


aatcttgact 


gcatcactaa 


1560 


tatctgcctc tttcttctgc ccatctgtgt ctgaattctt 


ccattcataa 


ggatgctgtc 


1620 


tt tgggttag ggccatctga acctgtctgc agaaaccttg 


tcttcaagta 


agatattgtt 


1680 


tggaggttct gggtggggga tactattcaa tccagtgtac 


cccatgtttt 


gctaaccttt 


1740 


gggcccccct tctcagggct tgtaccatat ctttaattgt 


atcatcctgt 


tcccaagaac 


1800 


cctgaaaggg cagaactatt tttcatctac tgctccacta 


cagggttagg 


gtagaagtcc 


1860 


ttggcatctt gatacaggca ggttcatggg gaggcctacc 


caaccaaacc 


aatgtagaaa 


1920 


gctgcccaat tcctacccaa gctcattgct aacaattttg 


gcactcacaa 


agccacagta 


1980 


gtagatgaca tgtgagtgta taaaagtcag tggatagagg 


gggtgggtgt 


aataccgtat 


2040 


ggcttatcct gggatggaga ctttgtggta tttcttgttc 


ttttcccagc 


ctcctagttc 


2100 


tctgcatgct ggtactttgc aggtcatgat aggcctctta 


ttatcctttc 


ctcttgcctg 


2160 


cttaccctga atgectgctg ttttgacagt ccatggatct 


aaacacctgt 


gctgagctta 


2220 


catgttctcc actatcaggt agttttgaaa acgtaagaag 


ttagacteaa 


tgtcaaatca 


2280 


ccctgtcatt tttcctgtta ctatggtctt ggtctctctt 


ctcagggcct 


ggcaaggctt 


2340 


aggggttact agtcctgtaa gaattaggga tccagaagta 


tggcaggtct 


ctatggtctc 


2400 


tgggaaggtg ccttggagag gcatctctag tcgggtcagt 


ggcagtgtgg 


gaagttttta 


2460 


ctggtttata gatgctttgg tgacaggatt - atctttcctt 


gcctttgtgg 


gttggcagcc 


2520 


ttggggggca gtgcctagct tgtggggagt atggaggatg 


tttgacaaag 


gactggtgat 


2580 


ggagtgattg tgccctttta gggtaggatg aacctggagt 


tgccaacctc 


aagagtgtca 


2640 


ctctcactgt gatctttagg gactgagaga tcctcttggg 


tcceacctcc 


ttccgtcctg 


2700 


tgttccttct cttgcfecttc tcctttcacc caataaacag- 


attcagcatc* 


cgcatggctc 


.2760 


tttgaccctt tggagatgcc cttgggcaga tcatagtgaa 


ctctttcttc 


atcccagccc 


2820 


tctggccact tagggttctc ctcgagagtg ggcaggatgt 


ttcat^tcaaa 


tttggtgaag 


2880 


ggaggttgct gcatccacag ggaagactga gcacagcata 


tgtggtatcc 


acgaggtcac 


2940 



® 
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agtgtctttg aaaccaaggt gcctac.tgag gatgggttgt aggctctagg ctcccttggg 3000 

tctcagaaat tgtatgtatg gatatatgaa ctggaaagca ggagggccat aatgct:aggt 3060 

tttgattccc atctcctcct ttgatcttgg acaaatggtt tcgacctttg ggagcctaag 3120 

aaataggtca gggcagtggt cactgtgctt gaagaccttg ctaggctagc tgtcatgtga 3180 

tttccatgac agacctcctg gatacacata gctagctgca tccattcttt tttctcagat 3240 

tgagacactg. aggcacagaa gctggacagg ttactcaagg tcacacgatg gacaaatttca 3300 

ggggtggcat gatgtagtct ctgtgctgct. cttgaacaag atgtggggta cgggggttcc 3360 

tttccaggta ttatcactgc tctcgtcttc atgtcacaga attcccattc tccagtggat 342 0 

ctagttctac ttcaaagtcc aggcatggaa acaggaaggg gctggttctg ctccattgct 3480 

tatctgtctg tggtcctcag acaacaagaa ctggaagcaa ttagccctca tagctgctgc 3540 

cctcatcctg tgccagtcac ccagagaacc aggtgccatc cacccacatc tctttccctc 360 0 

actccagcca aacccctctc tgtatcccca agggccctgg cttgtcagtc tgttgttggg 3660 

accagcaggc cctcagcttc cctcccgatg ccccaaagtc tttggatggt tccttctagt 3720 

gctcactgtg cattctctcc taactcatct cttatgccgt cctcacaggc caccctaatg 3780 

tcttctgtgc ctggttccct ctggagggtg ggcttcagtg gatttcctgt gagctgtcca 3 840 

ggtcttgagc agctccaggt gggtccaaaa gattagactg ttcctacagg atgaggggtg 3900 

ctgacctggg tagagctgtt ttggggtgcc ctgggacact caagacctat ttctctgcta 3960 

cacaagttgg aggttagaga tttgggtcac aaggtgacta agtcattttg ggggtgtcag 4020 

tcagggaggg ctttgcattc agggtgcttc atggtccctg tttatcctgt ttctttcctt 4080 

tttatttgga ggaaa:tagac aattgtgaga aaaactaata tttttagttt ttttattttc 4140 

ctagcgaatg ggagactttg ttttagtatc aatcacaaat aaagttgatt ttaaagataa 4200 

aagttgtctg ccttaaaaaa attctctgtg gtctgagagg tgtttgtttg tttctgagag 4260 

gtgtgaaaaa ccttgtcgtt ggggctggga agatagctca gctggtaaag tactttcttc 4320 

. agaacccata taaaagccag gcatggtggt gtgtatttga aatcacatgg tgagtgctgg 43 80 

acagacagag atagtgggat ctctggggct cactggctag gcagcctaga ctactcagca 4440 

cgcttcaggt tiagtaagaaa ctctgtctca aaaaataagt aaccagccaa ttgaccaacc .4500. 

aatcaataaa ctaactaact agctagctag. ctagctagct aactaactaa ctaactaact: 4560 

aactaactaa tgaagcaaaa caaatgaaaa cacaaagtga cttgtactaa aggaacaaca 4620 

cctgaggttg aactctggct tccacatcca tgtgactatt tgtatgcttt ccccaaccaa 4680 
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gaacaaagac 


atgaacacta 


tatacacaga 


gacacacaga cacacaaata 


ccacaaacat 


4740 


acacaccaca 


cacaaacaicc 


acatacaaac 


atacatacca cacacaaata 


ccacaaacat 


4800 


atataacaca 


cacatacaca 


gataccacat 


acaaacatat ataacacacc 


acacagatag 


4860 


agatactatia 


taaaaaaaba 


tacabacaca 


gaatggaggg aatgcagaat gctgcaaggt 


4920 


atiaaggtatia 


aggttcctgg 


ttggagactc 


aggcctcctt gttcagttca atattttgtt 


4980 


attgttttaa 


gcttattgtt 


acaaatggag 


aaactgagat aaatagcttg 


tccagttcat 


5040 


agctcagagc 


gtggggctgg 


acttggttgg 


tgcagtctta ctgcatagtt 


catcactgtc 


5100 


cacatgctag 


gatggaggca 


gcttaactgt 


catcttagct tggtcctaca 


cctctctgga 


5160 


tgggggttga 


tagcattitga 


gcagaagctg 


agtctctgag cagctgacag 


ccagctttgt 


5220 


ccaatgacat 


tctctaagtg 


gttgcacatg 


cttgcacact ctccaaatat 


aagctcccac 


5280 


cttgcataaa 


cagaagccac 


aagccaggcc 


ctgag 




5315 


<210> 2 

<211> 30865 

<212> DNA 

<213> Homo sapiens 

<400> 2 










tgtttatttt 


tattattttt 


tttttttaaa 


tttatttaac ataattttta 


ttaataataa 


60 


taataaatat 


agtttgtaaa 


ttttaaaaat 


aaaagagaag agataaagaa 


tagattaaat 


120 


gaattatata 


aatgaattaa 


ggtaatactc 


aatcctttgg atatgtggga 


ttatacacca 


180 


tataattttg 


ttatalittaa 


tattagtata 


tatggtaggg tgtatgtgag 


aaaaggtaaa 


240 


acttaaaaaa 


gcggtgaatt 


acaagatcgg 


taacgaaggt cgaaaagaaa 


ggaaatagcg 


300 


ggtgtctggt 


gatgtcttgg 


atttgagggt 


cgggggtttc gaactctttt 


taaggtccat 


360 


tatttaggcg 


cggcggggag 


agaggtatgt 


aaaaccgcaa' gatggggtcc ttatcaiggat 


420 


aaggaccacg 


agtgactgtt 


agcagaaagt 


aatatcatga ggaaacagta 


aaaaaaatac 


. 480 


ttatgagttg 


ggttgggagg 


ggactgagct: 


ctgagaaagg accccagttc 


actatgtcaa 


540 


aaacgaaaaa 


aaggc aaggc 


gcagtggctc 


acgcctgtaa tcccggcact 


ttgggaggcc 


600 


aaggagggca 


gatcacctga 


ggtcaggagt 


ttgagaccag cctgaccaac 


atggtgaaat 


660 


cctgtictcta 


ctaaaaatgc 


aaaaaaaatt. 


agacgggcat ggtggctact gttatcccag 


720 


ctacatggga 


ggctgaggca 


ggagaatcgc 


ttgaacccgg gaggtggaga 


ttgcagtgag 


780 


ctgagattgc 


accactgcac 


tccagcctga 


gtgacagagt gagaccccgt 


cttaaaaaac 


840 


aaaacaaggc 


caggcacggt 


ggctcatgcc 


tgcaatccca gcactctggg 


aggctgaggc 


900 
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aoQ t QQa tea 


ca'actQt:ca.cici 


agait cgagac 


catcctcicrct 


• aacatocftcia 

ddw^**» JJ^^ 


aaccc tat ct 


■ 960 


ctactaaaaa 


tacaaagaat 


tatcccagca 


^yy '^yy *-yy 


caoctataat 
v^y \^Vtf> *»y ^yy ^ 


cccaactact" • 


1020 


CCQaaQQCtQ 


aQQ t QQcraQa 


atQQccitcraa 




aaacic fc tore a 


ot aaac t aaa 

y wydy Vi» wydy 


1080 


attgcgccac. 


agcacticcag 


cctQCfcrcQac 




■ tccatctcaa 


aaaaaciaaaa 

WCd ^Ad d d 


■ 1X40- 


aaaaaaaaaa 


aaaa'aaaaga 


aaaaaaagaa 


aaaaoaaaaa 


aaaaaaaaaa 


aacaaacraaa 


1200 


aaccacaaatt 


aaataaaagt 


tQaocit a t oa 


tgtatgcaca 


-actacattca 


-ceectttaaa 


1260 


QQcacaattQ 


tgtgagttta 


gacaaatgca 


aatgttcatg 


a^A(ic^ciiL.ac 


aaatacaaca 


1320 


cagcagctct 


QCftiQcccaca 


aagcttcctc 


aaaatccctt 


acaatcaatc 


a ctar!r!f^cir"a 

dw w v^v^y 


1380 


cctcagccct 


cagagccact 


crcrciaaocc: c c 


ctaccctatci 


aactcaccta 


tectaaaaca 


1440 


t: eg fa t a aaa 


oaacrticacaa 


atcaeacrcct 


t ttcnafr't'CT 


a c t" t" c ^ CT t"- r* c 


w ^y vdy y ^ 


1500 


t oc a c c t oaa 
^ ^ *^ *— ^ *— 3J 


at t facfccaf 


f t" f^Ot" tT"f* t~ 

Vo V> Vr^J w W 


t" t~ t~ 5* f* t~ n/^ fr* rt 


cty L«y w^Vtf'Ci 


d^y L>y 0 i^y 0 u 




catQcacfaacj 

^ Z3 t-A CLV^ 


t" cic ir* ^5 a CI t 

^ w v.* d d. W w 


r'cifr*'t~fTf~ ^ a 0 


^ L. ct^ ^ i« 1^ ct 




gcgydydccu 






a t aacT 1 1 cf CI t" 


■Cfciaa t'cit' i~ i" r" 




a.y i«^v* ^y t-V^'d.y 


^ddy uc ccyc 


J.OO u 


aot crt* acr t c: c 


catcttattc 


ttccfcor'acfa 

1^ tjw^j w^HxrfCt^H CI. 


yy y *-y « i-«y v,- 


u-y L.y dw* u> 


\^ \^ dy w \^\mf 




acaciocticta 


acicra t taaac 




fift 5* 5* r»rT r« 0 1" t~ 
y y dciv^y \^\^ l l. 


v. v»d t-y y L«w> L«y 


^y dyy i»yi,«d^ 


A. 0 \J \J 




caaaaagct't 


ccaaattaca 




ydwi^y d^y 


cracTfCf t t f 
y dy v*y ^y il. li- ^ 


1860 


tcatc tccca 


tccggatgtt 


ttctttaatci 


aaatatT't* t" t 


i~ r* a aa t~ c t* c t" 


t OTT! 1 1 1 1 ta 


192 0 


tttttatttg 


ti t ti a t a ti a ti a 


tattttgaga 


cacraatttta 


ct ctttcact 


taaattaaaa 


1980 


tQcaacqaca 


cerate ttaac 


tcactgcaac 


ctccccccac 


ccacctcact 


ttaaatteaa 


2 040 


gcgattctcc 


tgcctcagcc 


-tcccga^tag 


ctaaaactac 
v» ^^y yy i*dw 


aaacacccac 




2100 


ggctaatttt 


tgtattttga 


gaagagatag ' 


aattt caeca. 


tattaaccaa 


'actaatct ca 
y ^yy *- i^wy 


2160 


aactcctgac 


c t c acrci t cia t 


ccsiCccciccitL 


aaacc^dcciA 

y v., ^ w ^ 


aaafeacaaaa 

d d d. w CA.y y 


cL ^ wd%>» ^^rjy ^y 


2220 


tQaaatacca 


caci^tggcct 


cttfcttattt 


i" t"aat*craaat*- 

i« L> dci v^y y y 


aacttatttt 
y y V wy v»\» 


fttat t aftf 


2280 


agt c t tggga 


crttcctcata- 


aa tcictcrcrat" ' 


1 1" r»'aat" fa 


t f t c'a ad ^ crt" 


a t CTca 1 1- 1 1 a 
ai»y wy ^ u> L> i-y 


2340. 


tagatgtttt 


ctgtagggac 


cagccccaca 


gggttggtga 


gtttctcdct 


gtgtgctgag 


2400 


atgagagggc 


atagaaataa 


ggacacaaga 


aaaagadata- 


aiaagaaaaga 


cagctgggdc 


2460 


tggggaccac 


taccaccaag 


acgtggagac 


cggtagtggc 


cccgaatgtc 


tggctgtgct 


2520 


gatatttatt 


ggatacaaag 


caaaagggac 




gtgtgagtca 


tctccaatga 


2580 


taggtaaggt 


gacgtgagtc 


acgtgtccac 


cggatggggg 


gcccttccct 


gtttggcagc 


2640 



_„02053701A?_I_> 



wo 02/053701 



PCT/USOl/49639 



-6- 

caaggcggag agagagagag agagacagct tacatcatta tttctgcata tcagagactt 2700 

ttagtacttt cactaattga ctactgctat ctagaaggca gagccaggtg tacaggatjgg 2760. 

aiacatgaagg cggactacga gcgtgaccac tgaagcacag catcacaggg agacggttag 2820 

gcctctggat aactgcgggc aggtctgact gatgtcaggc . cctccacagg aggtggagga 2880 

gtagagtcct ctctaagctc ccccggggga aagggagacb ccctttccct gtctgctaag 2940 

tagcaggtgt tttcccttga cactgacgct actgctagat cacggtccgc ttggcaaccg 3000 

gtgtcttccc agacgctggc gtcaccacta gaccaaggag ccctctggtg gccctgtccg 3060 

gcgtaacaga aggctcgcac tcttgtcttc tggtcacttc tcactatgtc ccctcagctc 3120 

ctatctctgt atggcctggt ttttcctagg ttataattgt agagcaagga ttattataat 3180 

attggaataa agagtaattg ctacaagcta atgattaata atattcatat ataatcatgt 3240 

ctatgatcta gatctagtat aactcttgtt attttatata ttttattaaa ctggaacagc 3300 

tcgtgccctc ggtctcttgc ttcggcatct gggtggcttg ctgaccacag ttttcctgct 3360 

gtatctaact gctggcttag gtgaacctgt caggggcgtg gctgggaact ggaacctcct 342 0 

gaagaccagt ggaggccaca agcagaagcc ctcggctcct tcccttcccc gatgctgggg 3480 

gccaagggtc tttctctgac cctgtccccc cctcacgtta ccagtggagg gtgtccaggt 3540 

tcttggcatt ttgaacaaag aattggacaa aacacacaaa caaagcaagg aaagaatgaa 3600 

gcaacaaaag cagaaattta ttgaaaatga aagcactctt tacagggtgg gagcgggctg 3660 

agcaagcagc tcaagggccc cggttacaga attttctggt gtttcaatat cctctagcgg 3720 

tttaccattg gttacttggt gtatgcccta tgtaaatgaa gaggatgaag tcaaagtcat 3780 

tttctcggct gagcatactg tcacgcctgt aajtcccagta ctttgggagg ctgaggtggg 3840 

tgtatcacct gaggtcggga gttcaagacc agcctgatga acatggagaa accccgtctc 3900 

tactaaaaat acaaaattag ctgggtgtgg tggcgggtgc ctgtaatccc agatagtcag 3960 

gaggctgagg caggaaaatc gcttaaaccc gggaggcaga ggctgcagag agccaaggag .4020 

tgcaccactg cactce^gcc... tgggcaacaa . gagcaaaact ccatctcaaa aaacaaacaa 4080 

acaaaacgaa gtcatgtact cggtatgagc cdtacgtaaa cggagaggat ggagtgaagg 4140 

tacaaagcca ttcacattcc cgtcatcgtt agtgtttcca gttgatttgg- ttctaggatg 4200 

cccttgggct ccetgcgtcc. aggccctctt ctcctgcctc actctcactc cagaggactg 4260 

gcagtgctgg cctccctggc aggactcccg cccctgacca g.aggggtctg ggcgtgctcc 4320 

gaggccacct gctgctcctt gcccagtgtc cctgggtgct ctgtggggac acagagaggt 4380 
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cacttgggag gttcctgcct ctgccagact gcggcgcctt cctggcgtca ttgccccctg 4440 

cacgcttata caaccagtga taaggcggta tccaatgccg taatcctccc cagggccagc 4500 

ccagccctgt ccggtcccgt ccggtccctg gcctgcttgc tgggagccca gctgggggct 4560 

gcatcggggg tggtgggggt ctggagtgag gggtgcccag gcctctctca cagtgtgggg 4620 

gcacaagctc tgccctggac actctgccag tgtgacccta ccccccatgt tigggggtigag 4680 

agaagggggc tttctgggtg gccagatccc aggagagggg gcctagccag accaacccca 4740 

gcccagctct ctgcagctgt accccaaagg ccctggccca gcccagecag acagagtgtg 4800 

gacagagggg cttgggtgcg agtccaggca aggcaggaca agtctctgct gtggcttggc 4860 

agtggccccg tgactggtga catagtctgg caggcctagg ggacaaatcc . tcagcccggg 4920 

tgcatcaggc cagagcctgg ctgtctggag ccctccagga aatgaccccc cgggctgggg 4980 

gacatccgag tgattgtgcc aaacaatgga caagggggcc ggagtctcag agccgaagtg 5040 

ccttccgctt ccatctgctc cgcccaggag caggtgcacc caggtggtgg cctgggctat 5100 

aaagctggcc ccctggggct tggggactca gcaccagggg ctggagggca ggggagggga 5160 

tgatgteatt cctgctcggc gcaatcctga ccctgctctg ggcgcccacg gctcaggctg 5220 

aggttctgct gcagcctgac ttcaatgctg aaaaggtacc aggggcctct gctgtcctgt 5280 

gOtgggtggg agctgggccc ctgccagaga caacgtgata attgtgacaa ctgcctttcc 5340 

tggggcgctt gctctgtgcc cggccaggca cgtgctatag acccactctg cgcttaatcc 5400 

taaaccaacc tgtgaatgga tcattactgc acccacttca cagctgggaa ctgaggctca 5460 

cagttgcact gtgacctacc caagatgagt tcctgtccac ctgggctcag ctcacaccca 5520 

agagatggcc actccgagac cccttgctgt gtgacttctg agttgtcccc tgggtccctg 5580 

ggcaaggagg gccctgctgc tggctgtcct ggcctgcggg tgactaagcc cccggcagtt 5640 

ctcaggcctc tggtacgtgg. tctccatggc atctgactgc agggtcttcc tgggcaagaa 5700 

ggaccacctg tccatgtcca ccagggccat caggcccaca gaggagggcg gcctccacgt 5760 

ccacatggag ttcccggggt gagttacttg ggctgggc.tg cctgggctgg gctggggagc 5820 

tgagigrtagct; gggtgccatg agtgagccgg ctgtgcaccc ccagggcgga cggctgtaac 5880 

.caggtggatg cpgagtacct • gaaggtg;ggc tccgagggac acttcagagt cccgggtagg 5940. 

.tcctgcctcc tctgcc'ttca ggggtgctgg atgccgcttc tctgggggtg acgtggggct - 6000 

gactttccac ccgggcccaa gctgtccatc ccagtctcca gqcccctgac cgccctcccc 6060 

gcccctcctc tgtettcggg gaacaggctc aeagcccagc agcaggggca ggcatgggtg 6120 

OOCID: <WO_0a0S3701A2_l_> 



wo t>2/053701 



PCT/USOl/49639 



-8- 



gggtcggtgc 


ccgggcccat 


ccacttctcc 


agcbtgaaaa ggccgcctgc ctctccgcag 


6180 


ccttgggcta 


cctggacgtg 


cgcatcgtgg 


acacagacta 


cagctccttG gccgtccttt 


6240 


acatctacaa 


ggagctggag 


ggggcgctca 


gcaccatggt 


gcagctctac 


agtgagtgcc 


6300 


tgtcccacct 


ccccgcactg; 


gcccctcccc 


aacttcccac 


aggccccgcc cccGgttctg 


6360 


cctacccccg 


ccccgcccgc 


ctgccgcccc 


gcgcctcctc 


acaggccccg 


ccctcacccc 


6420 


tccgcccccc 


gccttttagg 


ccgagtttag 


gctcegcctc 


ctcatccccg ggcccgcccc 


6480 


ttctggtgct 


gccccatgcc 


ctctcctgct 


gtgacccttc 


gcacacactc ctggcctggc 


6540 


caggaccggg 


tggggtggga 


gctgggcccg 


ccgtgctttg 


gatgtggagg 


aggttggctt 


6600 


gggggagggt 


cccgcggtgc 


atgfctcctct 


ctggcctcct 


gagagtggtg ccccgccctt 


6660 


ccctgcctgg 


cctcgctgcc 


tgagttctgg 


ggctggggct 


gaggttcttt 


tttttttttt 


6720 


tttttttaat 


atggagtctc 


actctgtcgc 


ccaggctgga 


gtgcagtggc 


gcgatctcgg 


6780 


ctcactgcaa 


cctccgactc 


cccagttcaa 


gcaattctcc 


tgcctcagcc 


ccccgataag 


6840 


ctgggactac 


aggcgcccat 


caccatgccc 


gtatttgtat 


ttgtaaaaat 


ataaataatt 


6900 


ttttgtattt 


ttagtragaga 


cggggtttca 


ccgtgttagc 


caggatggtc 


tcgatctcct 


6960 


gacctcgtga 


tccgtccgcc 


tcggcctccc 


caaagtgctg 


ggattccagg 


cgtgagccac 


7020 


cgcgccggct 


ggggctcagg 


ttctttaact 


cctggggact 


ccccgattgg 


agccgacaga 


7080 


gtgaggctgg 


aggcttccgg 


gctgcaggga 


aagaggctca 


aggggtagtg 


gggcccagga 


7140 


ggacagccag 


accaccccaa 


cctcccagac 


atcttggtgt 


gaattgggga 


agggggcccc 


7200 


aagtttcctt 


cagccaggca 


ggagctttat 


gccgcgcaca 


acctgcccaa 


cccagggtgg 


7260 


cggccttgct 


gtcagccctc 


aggctggtgg 


agtgaggcca 


ggagaccccg 


tgctgcactc 


7320 


ctgtgagctg 


ctgagttcac 


agtgacctct 


gctctgccca 


I ggccggaccc 


aggatgtgag 


7380 


tccccaggct 


ctgaaggcct 


tccaggactt 


ctacccgacG 


ctggggctcc 


ccgaggacat 


7440 


gatggtcatg 


ctgccccagt 


caiggtaccct 


ggcaagcccc 


gcccgcccca 


tgccccccag 


7500 


cctctccc.tc 


ccttcccatg 


agcccaccct 


tccttcccct 


cacagcacag 


gcagtgtgcg 


7560 




CTOl^ O O ^ rf^ 

yy irVip w 


u c c agy gg uy 


ggctggcagc 


ccctgcagcc 


cctaccccac 


/. u 


ctcccccaga. 


acagactacg 


ggagaggcfct 


tgggcagcgc 


tgfggagggaa gctctgggct 


7680 


ccacfctctg 


gggatggtgg 


agctgtccag 


tggcccttgg 


cacccagggg 


ttgggggctg 


7740 


cgggaagcga 


gggcggtcct 


ggccgtcagc 


atctcacgga 


tgtcttcctc 


cccacagatg 


7800 


catgcaaccc 


tgagagcaag 


gaggcgccct 


gacacctccg 


gagccccacc 


cccgcccttc 


7860 
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ccaggtgggt tctccaggcc 


ctgcagggga 


tgccctggtt 


gcct^ctcccc 


ticcctaticgc 


7920 


agcttgacat 


ctggttctgt 


ctgccccatg 


cctgccccgc 


tgttctgcct 


ggegacccca 


7980 


tgcttGaggc 


ct cage t tag 


acattacctc 


tr t ccaaaaacr 


Gctcccttga 


1 1 tcct^aaGfcr 


6040 


cacccatcac 


ia^agtaccac 


aaatggggcg 


ac ttaaaaca 


gc agaagt 1 1 " 


33 wv..^^^ ^3^ 


8100" 


agtggctcat gcctatcatc 


ccagcacttt 


QQQaQqcccra 


" cTQcaQQccrao ' 


tcacctgagg 


8160 


ttgggagttc 


gagaccagcc 


tgaccaacat 


ggagaaaccc 


eg tc tc t ac t 


aaa'aatacaai' 


8220 


aacttagctg 


ggcgtggtgg 


tgcacacctg 


taatctcagc 


tacttgtgag 


octicraacicaa 

3^ wvj 


8280 


gagaatcgcc 


tgaacccagg 


«L93cggaQQt 


tQCQOftCiaQC 


CQaQa 1 1 Q ca 


c c at t a c acft 


8340 


ccagcGtggg 


tgacaagaga 


gaaactccat 


ctcaaaacaa 


a'cagacaaaa 


ccaocacraccr 


8400 


tttatctctc 


ccagttctgg 


aggccagaaq 


tccaaaatca 


agacgtccac 


'*333 ^ ^33 ^ 


8460 


cctctggaiag 


ctctgaagga 


gactccctcc 


cacaccgccc 


ccccagcticc 


cacciactcror' 




gcttgtcctg 


ggcatttctt 


QQCCtQtCiQa 


agcatcagcc 


ctgtctccgc 


c tccat cctf 

w \^ w w Vi^ 


8580 


ccacagccac 


cttccttctg 


tgcatctcitQ 


tccaaactgc 


cccctaatat 




O Ore V 


gaagcgagtg 


ccccacggcc 


ctcttcttag 


gagcatgccti 


oticrctitiaaat 


tcaaacfCfca 

V* ^ w V* wci 


8700 


ctgtaaatgc 


aggatgatct 


aatctccaga 


ttctaaacta 


GrtQcrtatcticr 


caaacfacccft' 


'8760 


ttccaaataa 


ggccacgttc 


9gaggttcca 




aaci 1 1 1 1 crcia 


oacacoatitic 


8820 


aatccacagc 


tctgtgtccc 


cgctgtatcc 


acacagcccc 


t:ggctccctc 


cccagggcag 


8880 


ggtcctgcct 


tggaattgtg 


ggagccttgt 


cttctccccc 


acTQQCccaaGr 


aocici c aoa c c 


8940 


acagccttcg 


cgcgcagtgc 


ccctcacagg 


agccacgtgc 


QcrqcQQCiQtc 


titacaaaaat: 


9000 


ccccccctaa 


accagacgcc 


9ggagaccgc 


tgggtccctc 


ac ccrcTQac t c 


accaccaaoa 

Vi^^4 Vi« GIL 


9060 


atttgggcac 


agccacacgt 


cacgtgtctg 


acgtgacagt 


caccggcgtc 


tiaaaoaoaca 

3 y ^ y w. V-O- 


9120 


ctggcccttc 


ct^gcatggc 


gggaggaggt 


QCfQCQQCtCt 




totttctcch 


9180 


gcgtttctgc cgtgctgctg ttgcgtttcc 


tgcatctctg 


ctcctctcca. 


t gage 1 1 cgc ■ 


9240 


dtccactdcc 


aggaccctct 


ccctggagac 


tcgccgtcct 


QCCtQQCraac 


a'c toaQcrcticr - 


9300 


ttcagctttg cacaagtctg; caccagcgtc 


tccctgcaca 


accccQctiaa 


crt tioqtiaaac 


9360 


atggggggca acaccaaatic gcccttgtcc- 


agaaggttct 


■gtgggcagaa 


catgtagccc 


9420 


catcccgcca 


tgatcttgaa gaagcagctt 


tcggggtaga 


gtgccccgcc 


tgggccacca 


9480 


cgcgtgacGg 


tggaggctgt 


gggttctgtg 


gtggcttcat 


g^acattccg 


gggcttttct 


9540 


tgccatgcag 


cccccttcac 


c^9cLggaccag 


gcagaggcca 


tggctcactg 


ccactcagcc 


9600 
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s^gggactggg 


gcatggcact: 


gggtcctccc 


gcgccgggga 


aggtggggaa gtgctggagc 


9660 


cagggatggc 


ctctggggca 


gcctctcgcc 


tggggcctgt 


ggggcagcag 


tggtagctgg 


9720 


tgtgtgtttc 


caggtttgca 


ggagttttgg 


gagcagaatg 


agctGtccct 


ggcctgtcag 


978 0 


tggtggcttt 


ggaagagtga 


tgcccagct.t 


gtggggagca 


ggaagggcgt 


ttgtcgcggg 


9840 


agtgggtgat 


ggagtaggca 


tgcgttttct 


acgcaggtgg 


agccaaagca 


gcaggcgcct 


9900 


ttgcccctgg 


agtcaagacc 


cacagcccfec 


ggggaccacc 


tggagtctct 


ccatqctcca 


9960 


ccccccgcct 


gtgggatigcc 


ttgtgggacg 


tctctttcta 


Ctcaataaac 


agatgctgca 


10020 


gcctcatggc 


cctcacctct 


ttggacatgc 


cctgggggag 


ggcaatgcgg 


gcccctgact 


10080 


gaccccagca 


ctgctgggga 


gactgaggca 


gggaggcaag 


tgtgagaagg 


ccctggccgg 


10140 


gtgagcaggc 


cctggggtga 


gagggcaggg 


ggcaggggag 


aggggccctg 


ggaaggggca 


10200 


gggggcaggg 


gtgaagggcc 


ctgggtgagg 


gggcagggga 


caagggtgag 


gggccctggg 


10260 


tgaagaggca 


ggggacaggt 


gtgaggggcc 


ctggggaggg 


gcaggaggea 


gcagccactc 


10320 


ttggcctgtg 


tgggggctgc 


acccacagga 


agaatggagc 


atcctggccc 


agtccctatg 


103 BO 


ggtctgcagt 


gctgtgggga 


actgggcacc 


ctgtgaagac- 


ggcccggcca 


ctggggctgt 


10440 


gctgggcaca 


caggccccgg 


tgctggggcg 


gggagctggg 


ggagctgcag 


gaggccgggg 


10500 


agttaggccc 


acttgggttt 


tgcccctcac 


cactgatatg 


atcgtgggca 


cctggtttac 


10560 


acccctggaa 


atgggtcacc 


gttgtggcca 


ggggactgcc 


agccagcctg 


agcgctccac 


10620 


agcagaccca 


cagacatagg 


tagctacccc 


gcgtactcct 


cgtttcaaag 


atcaggaaac 


10680 


tggcctggcg 


cggtggctca 


cacctgCaat 


CGcagcactt 


tgggaggccg 


aggcgggtgg 


10740 


atcacttgag 


gccaggagtt 


tgagaccagc 


ctggccaaca 


tggcaaaacc 


ccgtctctac 


10800 


taaaaataca 


agacttaacc 


gggcgtggtg 


gcgcacacct 


gtagtcccag 


ct;act:cagga 


10860 


ggctgaggca 


ggagaatcgc 


ttgaaeccag 


gaggcagagg 


ctgcagtgag 


ctgagafctgt 


10920 


gccactgcac 


tccagcccgg 


gcgatggagc 


aagactctgt 


ctcagaaaaa 


aaaaaaaaa:a 


10980 


aagaaaaaaa. 


aatcaagaaa 


ccgaggcaca 


gaagggttgg 


gtaacttatc 


caaggctgca 


11040 


cagccaggaa 


gaggccaggc 


aggctgcaag 


cctgggtgat 


caggctccag 


ggccgggctc- 


ill 00 


cacactgccc 


ccgggtggtg 


atgcctggtg 


cagc^gaggc 


cctccccact 


gtcactgtca* 


11160 


ctctggtcgc 


tggcgctgtg 


cctccggcct 


ccggagctct 


gactgcccac 


tgcgtctggc 


11220 


acacatttca 


aagggccagg 


cgtggggtgg 


cacaggaaag 


tgctggaggc 


tctggtccgt 


. 11280 


tgcccat:cag 


tgcccacagc 


cctcgcagcg 


caggccagaa 


cgagcccaaa 


aaaacaagtg 


11340 
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tcatgtgggg 


tgggggtcac 


cgagcc.ccaLg 


ggctccgaga 


cggccccagc 


agcgtccacc 


aggctgctgt 


ctaggccagg 


ctaacccctg 


gcgcttgtgc 


agaaacccac 


gtctctgagc 


cccgggtggg 


cgcacaagcc 


tgcactgtcc 


ggaacccgga 


gcccggaccc 


cgccacggag 


ctcgtggtgc 


tccgggcgca 


ggfcggcagca 


gcttctcctt 


cttagttcgt 


ggggctcctc 


ggaggcctct 


cccggtgtgg 


ggagctcgtg 


gttagcggca 


ttttcttctc 


egctgagtct 


caggcccaga 


gaaggtagat 


gccggagacg 


ttttaaaagt 


ctcagcggaa 


gtcccggctc 


ccGctggcgt 


ggttggcagg 


ctggcgagct 


ttctcactcc 


gccgcgccct 


gcgggctttc 


tccgaggtca 


ggcagggcct 


gtgorttcctc 


tccgcagctc 


tagatggccc 


agtggggagg 


cctgaaggca 


ccgcttggag 


ggacacatgc 


tgctttgcca 


ccgagcctea 


cagccatctg 


ccgggggttc 


gtggaatgcc 


cctgggggtc 


tccgtatctc 


tctagattgg 


aggattctgg 


ctggtgctga 


cggcccQgaa 


gcgggtggag 


ctgaccgtga 


aggttgcata 


taacaggtaa 


catgcatggc 


cggggactgg 


gcgtgggttg 


gagtgccgtc 


ccgctggtgg 


gccagcactc 


ccaggctcct 


ggatctgcat 


tttatcccaai 


tgaacttaga 


aggaaggaaa 


aggggtgtgg 


cccaggggaa 


ggccccacct 


ggcacattcc 


ctccacagtg 


ctgcctgtgc 


ccttcctaaa 


caggcgggac 


acaggctagc 


attttaccga 
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gtggtgtggg 


tggagctcct 


cttcetctca 


11400 


gctgtccata 


cgcca^gagg 


gtggctgggc 


11460 


cggtgggcgt 


gggtgtcacc 


agggccgatg 


11520 


tgccagcagc 


caagctgtcc 


tgatgacatt 


11580 


gtatagaatc 


ggcccaggct 


grtgcagcagg 


11640 


gccaggctgc 


cgtgcaccat 


cctgggtgtc 


11700 


gceatggagg 


agctggaccg 


gcagaaggtg 


11760 


ctacaccccc 


aacccctcag 


gctcagggaa 


11820 


gggacgctgg 


tgcccggcta 


gacacttcct 


11880 


gtgccggctg 


ctgggccaga 


ggcacattag 


11940 


aagattcttt 


cctcccgaaa 


atggtagggt 


12000 


tgggccggtt 


gctgagggca 


ggaggcccat 


12060 


gcgctacccg 


agccacctgt 


tctctggcgt 


12120 


ttttccaggc 


ccctgctcct 


gggtcctgcc 


12180 


ccgacatgtc 


gcagaagccc 


cagggactgt 


12240 


ggctgccctg 


tgggcattgc 


tgtctgatgg 


12300 


ctggggacag 


tgggctcaca 


gatgtcttgc 


12360 


ctgacctctc 


agagcccagc 


aggcccctgc 


12420 


tcagacccac 


tgctcagctc 


ttggccaggc 


12480 


agggaagtcg 


gtgtggcctc 


cgaticaaagc 


12540 


ggcttgttcc 


tcaccttgag 


cgggagtaac 


12600 


gaggcctggg 


ggctgctgga 


gaggaagagg 


12660 


ctgcacccgc 


tgcagcgtgg 


ggtctgggct 


12720 


ccggtgcccg 


tctgcctgag 


ccctcgaccc 


12780 








O ^ W 


gacaggtgac 


tggctggagg 


gaaGagaatc 


12900 


ccgcctcatc 


ctgctccagg 


ccctgggata 


129iS0 


gacacagccc 


cactccggga 


aggagcccag 


13020 


tgggcacttt 


gctgcctttc 


agctcaggaa 


13080 
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Qctotaagat 


agagaagatc gtgggctcag. aaatagacag tacgggaaaa ttcgcttttc 


13140 


ctggtaagtq 


cagttgccct gtgatggcag gtggaacccg gctgtgGaca cagctaggcc 


13200 


ttattgttcc 


ccatgctgtt ccctgcactg ttccccatgc tgttccctgc actgttctct 


13260 


gtgctgttcc 


ctgcactgtt ccccatgctg 


ttccctgaac 


tattecctgt gctgttcccc 


13320 


atgttgttcc 


ctgcactgct 


ccctgcactg 


ttccacatgc 


tgtttcctgc actattcccc 


13380 


atgctgttcc 


ctgcactttt 


ctctgcgcca 


ttccccatgc 


gttccctgca ctgttccctg 


13440 


cactgttccc 


catgctgttc 


cctgcgctgt 


tccccatgct 


gttccctgca ctgttcccca 


13500 


tgctgttccc 


tgcgctgttc 


cccatgctgt 


tccctgcgct 


gttccccatg ctgttccctg 


13560 


cactgttccc. 


catgctgttc 


cctgcaatgc 


tccctgcact 


gttccccgca ctgctccctg 


13620 


cactgttccc 


catgctgttt 


cctgcactat 


tccccatgct 


gttccctgca cttttctctg 


13680 


tgccgttccc 


catgcattcc 


ctgcactgtt 


ccctgcactg 


ttccccatgc tgttccctgc 


13740 


aatgctdcct 


gcactgttcc 


ccgcactgct 


ccctgcacta 


1 

ttccccatgc tgttccctgc 


13 800 


acttttctct 


gtgccgttcc 


ccatgcattc 


cctgcactgt 


tccctgcact gttccccatg 


13 860 


ctgttccctg 


cactgttccc 


catgctgttt 


cctgcaccgt 


tccccatgct gttccctgca 


13920 


gggtt ccc tg 


cactgttccc 


catgctgttc 


cctgcacatt 


tcatgcccca gaccttccca 


13980 


ttctcccacc 


aacacactgg 


atcatccttc 


aaaagcttct 


gtagtgtctc caaccactca 


14040 


ag tqc tqgcf a 


ctgggtgggg 


gcaggatgga 


gttagaccct 


gcagaccctg gccttcgagg 


14100 


tccgtccccc 


tcagacgtct 


cccccaacgc 


catggccggc 


tcttgaaggc cacagagaga 


14160 


tccacgtgct 


ggacaccgac 


tacgagggct 


acgccatcct 


gcgggtgtcc ctgatgtggc 


14220 


3993caggaa 


ctttcgcgtc 


ctcaagtact 


ttagtaagct 


tggccctggg gggctctgcc 


14280 


cagctgctgc 


tctcccaggg 


actgcccgcc 


cagcccccct. gtgccccaca gctcggagcc 


14340 


1 1 gaggacaa 


ggaccggctg 


gggttctgga 


agtttcggga 


gctgacagca gacactggtc 


14400 


tctacctggc 


ggcccggc.ct 


ggtgagccca 


ggggccttgg 


ggtggaggct gggctgggcc 


14460 


ctqtgggct'g 


actctgcagc tcctcatgct ggcctatcct gcagggcggt gtgccgagct 


14520. 


cctgaaggag 


gtgagcttga 


cccccgaccc 


tggcctgtge 


tgaagttccc gggcccctgg 


14580 


cccagtccct 


ggccctgtGa^ ggagcccccg , tggctccgcc 


. tcccggccct gggctgggcc 


14640. 


ttctcacccc 


ttcctgtgaa 


caggacacca 


aacaccactg gtgggcagct ccagagaitga 


14700 


gtctgtctcc 


tggtttggaa agagctggaa cctccagggt ggtgacccta ggctgccagg 


14760 


cagggaccct 


gggaggctgg ggtcacgggg tgcagagctg ggtgjgggcag gggagcagaa 


14820 
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atggcgcctt 


ttcttcggtg 


tfcccgtgcag 


gaictgcGggc 


tgcttctgcc 


ccGgaaggtc 


14880 


ccgtcggcgg 


cggggcacag 


atcctgcggg 


cgctgcctca 


gggctcccat 


gttgggcact 


14940 


gcgagaaccc 


agtgtctccc 


tcacctcgcfe 


ttgtcttggc 


cctagaggct. 


gggcGtgtta 


15000 


ccccattttg 


cagattgaga 


aggcgctcag 


ggagctgggt 


gcttt^cgca 


aaaGcaggfqa 


15060 


gcgaggacag 


aagtcccgcc 


gtgtggccct 


catcgaagcc 


Gcgtggggcc 


tcGagagacc 


15120 


acacgggcct 


gagcccctgc 


acttctgtgt 


cgcaggagct 


gattttaatgg 


agttGGtgcc 


15180 


tcagaccaca 


aggttcggag 


cgcccgccca 


cccctgcccc 


tcctgggcac 


cctgcccacc 


15240 


aggtcacctg 


cacctgctct 


gaataaactg 


tgaagtcaag 


ccactgGctg 


gtgtgtcctt 


15300 


ccggagggcc 


gatgggtgac 


aggtgtgggg 


ggactcaccc 


gccccaggtc 


bggcaacagg 


15360 


;7) aggtgtgttt 


tccgtggtta 


ctggacaaac 


agtgcggctc 


acgtgcaagg 


cagcaacccc 


15420 


tgccctcccg 


Gcctgctagg 


gtcgtagtca 


ggctgcccca 


gggtatagaG 


Gagagacaga 


15480 


aacagggctc 


ctggtggagt 


ctcagcaggt 


ctgtcaaacg 


catcrqacfQCC 


acacccctcc 


15540 


tgcctgcatc 


ctccatggct 


ccccactgcc 


cagggetcca 


QQCtCaCQQC 


ccciticiGtQaa 


15600 


ggccctaccc 


gatgggcccc 


gcttccggcc 


tccctggacc 


Gccagctgcc 


GagccGtgct 


15660 


cagtgtccgc 


ttccacagtg 


ggctgtcacc 


ccgtggcctc 


ggcfcgGGaGC 


agccgactgc 


15720 


atcactttct 


cctcccgcct 


ggcctgggac 


acgggaaggg 


Gaaggaaagt: 


gacagtgtoa 


15780 


gcccagaggt 


tctgcgccca 


ggcccctggg 


gaatcagccg 


gtccccacat 


atgcaggtga 


1584 0 


gacaacccag 


gcttatgagg 


gtggctgcct 


cgtcggggcg 


gcggaaaatg 


atacctgagg 


15900 


tcgagtggtg 


ccgggagctc 


tgggggaaca 


gcccgagtag 


ggggacGtGg 


ccagccccca 


15960 


•^p gcacggccgg 


gacactggaa 


aggccctgct 


aggaacaagg 


gctgcaccct 


agatgcgcct 


16020 


caaatccaga 


actgtgeccc 


cgagacccac 


aggagggaca 


tgtagaggct 


gGattGtgaG 


16080 


aagt:t:agggg 


ctcctgggaa 


acagccgtga 


ctttgcgcag 


ctgtgcctgg 


ccgaagccca 


16140 


cgggttcaag 


gagcttcgcc 


tggacttgga 


tttcctgctg 


Qaqqggctgt 


cactGtcgct 


16200 


gtcagctcag. 


gctcccgtga 


c aaaat:acc c 


GaQactQQQC 


aGCt:t:aaaGa 


aGa<^aaGtgt: 


16260. 


atttccbcac 


cgttctggag 


ggtggaagtc 


caagattaag 


gtgcGggaaa 


gtgcagtttc 


16320 


tggtgagggc 


ttcctgccca 


ttcgcagaag 


gcagccGCtc 


gctgtgtGCt 


GaGGtggcag 


16380 


ggagcgctct 


ctgggtgcct 


ctttttctcc 


taataaggat 


atcagcctgt 


taaattaggg 


16440 


ccactccctt 


tgacctcact 


taatctcgat 


tgtctcctaa 


gagtGGcatc 


tcctgaaaca 


16500 


gtgaccatgg 


gatctggggc 


ttcaacgcag 


gaattttggg 


agggGagaat 


tGatGccaga 


16560 
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acaaaagtca gtgctgttta gagggtgaca gtagatgcag ggctgccgtc atcaacaacg 16620. - 

tcctgcccaa aagtcaetgg acgtccaaag acacagaaaa gtatgaccta tgttaaaaaa 16680 

aaaaagaaaa aaaaagttac taaaaactga caccgagaag acccaaagtt gtattcagaa 16740. 

gataaaggtc taaaacagcc attataagta gttaaaagaa aagcaagata tactctaagg 16800 

aaagaaatag tgacacaata gcattaaagg actaactaat tgataaagaa t'taaggagtg 16860 

aacgtaggag atcagatcag gaaatcaaga gagaaatgga aactataaaa aagaaaaaat 16920 

tatggaactg aaaagtacaa taaatgaaaa atttactgga aagactaata ggagttggcc 16980 

atggcaaaaa taatgtccat gaatttgagc acaaatcaat gtaaattatc cagtctgagg 17040 

aacagagaag aaaatggttg atagcacctc aaagatatgt ggggcaagat aaagaatgta 17100 

aactagcctg agaaagagaa gagagtgaga atacatgaag aaacagtggc caaaataacc 17160 

caaacttgat ggaaacatca acttagccca gaggctcaga gaaacccagg caggatgaaa 17220 

aagaaagcct cacaaaaatc acacctgggg acattatagt caaagcactg ccaaccaatg 17280 

atttggataa aaatttaaag ggatctgtgg attaaagaca gggcacatcc aggtttggtg 17340 

gctcacgtct gtaatcccag cactttggga ggctgaggca ggaagatcac ttgaggtcag 17400 

aagtttgaga ccagcttagg caacatagtg agactttgtc tctctttaag aacaaacaaa 17460 

aaagacaggg catgatatag tttggctctg tgtccccacc caaatctcat gtggcattat 17520 

tgcaatgccc aatgttggag gtagggcttg gttggaggtg attggatcat ggaggcagat 17580 

ttccccttgc tggtcttgtg acagtgagtg agttctcagg agatctggtt gtttaaaagt 17640 

gtgtagcact tgccctttca ttctctctcc tgctggccat gtaaagatgt acctgcttcc 17700 

ccttcacctt cagctatgat tgaaagtttc ctgaggcctc cacagctgtg attcctgtaa 17760 

agcctgtgga accatgagtc aagtaaactt cttttcttca taaattaccc agtctcaggt 17820 

agttcttcat agcaatgtga gaatggacta atacagggca taaacagggg gacacaatgg 17880 

ctgtcttctc atgataaatg gaaaccagga gatgacaata ggccatcttt acattgatga 17940 

aaaaaaaaat caacgcagag ctctttatct aggacaaata tccttcaaaa ctgagggtca 18000 

aataiaacata cttttcaaca aabggaagtt gagggaattt atctctagca cagcttcagt 18060 

acaataaata aatgccaagg atgttcctta ggcagaagga aatgactctg gatggccact 18120 

caggtctgca ggaagaagat ccgtgtgaca agtggggacc aagtgagccc actggaacct 18180 

gcattccttg ccctccattt ggttacttaa ttatatcagc atggactcag cagttcctgt 18240 

tttcttccat aggttacaat ttatgacaat catttctacc caatgggagt gccttcaaag 18300 
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tggcttcaga gtgttgttgg 


tacattccct 


gattccttga 


gcacattttt 


gctttttggc 


18360 


acaaaaaaca 


ttccaagttc 


ctcttgcatt 


ttctgtacac 


aggctttgaa 


taatt caeca 


18420 


gagtcctggt 


tccttttcat 


atagaatggt 


gtgtggagaa 


accaagctct 


gagtgttggg 


18480 


tgtgctttta cttttactgg 


gtccgafctgt 


atctaaacct 


tctcaacaca 


cagagcctgg 


18540 


actttccttc 


ctcctcttcc 


tccctccttc 


ttcttcccct 


ctcctgcctc 


CGtctctctc 


18600 


tctccttctt 


cccttcccct 


cccatttetc 


cctccctctc 


tccctccttc 


ctctcctccc 


18660 


tgccctccct 


ccctcccttc 


cctccttcct 


cccctcctcc 


ctctctcctt 


ccctccctgc 


18720 


ctctccccat 


gtgtctacca 


aaattggagt 


tcacatggat 


atttccaatt 


ctaatcagac 


18780 


gccacagggt 


tcactttgat 


ctctcccctt 


tctatatgtc 


tatatttata 


attctctttt 


18840 


ccaacacaga 


gaaacccatt 


atccttgata 


tatttactca 


tttggtccct 


tctagagcac 


18900 


acggaaagta 


gtabcagaat 


tgctcaccca 


aaccactgtt 


agtcatgaac 


ctgcccatga 


18960 


gagcttagga 


tgtgtttaca 


gctctttGtg 


ctgttagctt 


gagaatacag 


agaaaaacta 


19020 


ctgtgttcag 


aagttacGag 


gggcagtcct 


ttccccttca 


gtgtggctgt 


catttatcta 


19080 


"taatgcaact 


ataticccatg 


tttggaattt 


gtttatttta 


cttttgagta 


taacagtata 


19140 


acataccaaa 


ttactttgat 


tccaaagtca 


aaactataca 


aaaacgttta 


ctcaaaaaag 


19200 


tctaacttca 


tccccccacc 


ctgttccagc 


tccccacccc 


agcagggaac 


tagtctcatt 


19260 


agactctgag 


gtttatcctt 


tgtttttgtt 


tgtttgtttg 


tttgtgtgtt 


tgtttgtttt 


19320 


tttgagacgg 


agtcttactc 


actctgtcac 


ccaggctgga 


gtgcagtggc 


gcaatctcgg 


19380 


ctcactgcaa 


cctctgcctc 


ccgggttcaa 


gcgattctcc 


tgcctccgcc 


tcccgagtag 


19440 


ctgggattac 


aggcttgcgc 


caccaccacg 


cccggctaat 


ttttgtactt 


ttactagaga 


19500 


tgggertttca 


ctaagttggc 


caggctggtc 


tcgaactcct 


gacctcaggt 


gatctgcccg 


19560 


cctcggcctc 


ccaaagtgct 


gggattatag 


gcatgagcca 


ccacacccgg 


ctgatgtagc 


19620 


gtgctgtttt 


gtggctgcct 


cgagaaccag 


igctccccgca 


gcccttaggg; 


acactagtgg 


19680 


ggctggcagg 


tgtttccttc 


tcagaggcct. 


iggcccagctg 


ctcggcttcc 


tcctttttgt 


19740 


atttttattt 


tatibttagag 


atiggggtctt 


gctccatcac 


ccaggctgga 


gaggagtggc 


19800 


acgat:cacag 


ctcactgcag 


ccttcaacct^c 


ctgggctcaa 


gcaatcctcc 


tacctcagct 


19860 


tcctgagtag 


ctgggactac 


aggageacgc 


caccatgtcc 


agctaatttt 


taaacttgct 


19920 


tgtagagacg 


aagtcttgct 


atgtcgccca 


ggctggtctc 


aaactcctgg 


actcaagtga 


19980 


tcctcccgcc 


tcagcctcct 


gaagtgctgg 


gattaccgcg 


agcagccacg 


gctccctcct 


20040 
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gaaggctcca ggcctaagtg tcccgctccc ttgttctgtg gctcagcccc bgggcaactc. 20100 

ctgcttgcca tggctgtggc tggcaGCtcc gtggccccac tgcccggtca gtcttcgcag 20liS0 

gGccagcccc cagtagctgt gtggtttctg ccttcctcct gcactctggt ggtcacaggc 20220 

cctgctgcgt gggtacaggg caggtggcca gggctgcagc ataagctctg. tgagtgcagg 20280 

cggccctagg cactggagcc ttgttgcatg. gacgtgctcc ttgcacectt gttgatgaat 20340 

gagcgaaggt gtgtgggagt gaatcccaga gggtgcaggc cacggcactg ggggaggacc 20400 

ctcccggctg cctcccagtg ggcttcaggg gctcagatag accctgccga gcacctctgt 20460 

cctctgctcc cggactgggg ctgcagagag ctctcaaggg gccttgggag gcacaagcag 20520 

gccagggtcc cagggagggg aggcaggagg cagcagggcc ggatggtggg ctgeagaggt 20580 

ggaaggaaca ggacagtctg ttctgggggt gctggctgtg ggggacagag gccaiggGcac 20640 

aaagcagaat ggtgggtggg ccaggaggag gtgggagcct ttttcaaaag gctttcaggg 20700 

actttttata tttaatgatt gaaacaaaag atatccaaaa ccgcagttcc tgagaaccac 20760 

ttgttctcgc cctggttcct ctccttgggc ttgaccttgg cctcctgccg ggcctggcag 20820 

cggacagcag tctccctgct tcccaggctg ggcgagggag agcgtgggtc cccaggaacg 20880 

ggggaggtgg tgcggctggc tttaggatgc ccattctgga accttctggt ggaagaaagc 20940 

aggggatcag tatagtatgg ttgatccaca cactccagac cctctcctgg ctgtgcagct 21000 

ggtctgatct gggctcagag gctgccactg ctcagtgctc ccctgggtaa ttttgcagga 21060 

ggccaggagg gttcctggca gttagtgcca cccacacgga gaaattcaga gccatataaa 21120 

cgggtctcca gggcctggag ggactgcaca tcctgggctt geggcgcagb gtagacctgg 21180 

gaggatgggc ggcctgctgc tggctgcttt tctggctttg gtctcggtgc ccagggccca 21240 

ggccgtgtgg ttgggaagac tggaccctga. gcaggtacag tcctcctggg ggtggggaga 21300 

gctggtcctc gggggccagc ccctccttta aggccacaca gcttctgggc cccccagggc 21360 

tagcccagac cagcatgagc agtggggaat tagttgggcc agcccttggg gagtcccaca 21420 

ggcaggagcc tcagggcagg aggggtggat gctggagggt tggaggctgg agggctggag 21480 

aattggaggc tggagactgg aggttgcagg gctggagggt gtagggttgg agaactggag 21540 

gctggagggc tggaggctgt agggctggag aactggaggc tggaggctgc tgggctggag -21600 

gctggagggc tgtagaactg gaggctggag ggctggaggc tggagggctg cagaactgga 21660 

ggctgg'aggg ctggagagct ggagactaga ggctgcaggg ctgcaggctg gaggttggaa 21720 

ggctggaggg ctggagccta gaggctgtkg ggctggaagg ctggaggctg gagggttgga 21780 
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gggctggaga 


actgaaggct' agaiggctgca 


21840 


agggttggag 


actggaggct 


ggagggctgg 


21900 


gggctggaag. 


gctggaggct ggagggttgg 


21960 


agggtggagg 


ctgaggctgg siggtttg^ag 


22020 


tggaggctgg 


aggctggagg gctggcggcc 


22080 


gggccctggt 


acgtgcttgc ggtggcctcc 


22140 


atgaagaacg 


tcgtgggggt 


ggtggtgacc 


22200 


tcctctcagc 


acgggtgagt gggcgggtcc 


22260 


ctcagccaca 


tcatgtactt 


tgcacatctg 


22320 


acggcccctc 


tgcacccccg 


tcacttttcc 


22380. 


ctgcccagct 


actcacatct 


gtgtccctgg 


22440 


ccgcggcggg 


cactgctctg 


actatgctcc 


22500 


ctgcaggctg 


tggtcagagc 


tggggaagcc 


22560 


ctgtttaaag 


gaggataatg 


cattcggacc 


22620 


tccccaaaca 


atatggtcat 


ggacgctgtt 


22680 


cgcacccgac 


cacafcgcatg 


gctggtggcg 


22740 


gggcggccgt 


gctacacagg 


ggacacctac 


22800 


tggggcagga 


gtcggtgccc 


gcctcccttc 


22860 


gggtgtctac 


cccctggcag 


gctggcgaca 


22920 


cggggcccag 


agagaagaac 


ctgccctcct 


22980 


ctgcagggta 


tgggcagggg 


CGCcactgca 


23040 


gcfcgggacca 


aggggctcca 


ggctgaggtg 


23100 


ttgctgttcc 


tcccgcatgg 


actcagcctc 


23160 


ctttgctggg 


tttfcgctcac tct:ggggact^ 


23220 


cgagtcgcag 


gcaaaccccc 


accccgcccc 


23280 


accccacccc 


atccttcccc 


tgagtcacgg 


23340 


tgagttgcgg 


gcgaaccccc 


accccgcccc 


23400 


accccacccc 


atctttcccc 


cgagtcgcgg 


23460 


gtgtcctcag. 


cagcattcag 


cgttcctggt 


23520 
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ggagatctgg cactgaggat ctctcaggaa. gtgaatggga gtgtccgccc ctctcaggac .23580 
tcccacgccg tccactctcc gagaacaggt ttccgggcat cggggcatct ggcagaagat 23640 
ggaagctctg gcctcactgc tggctggcct ggcctggact cctgagctcc gtctcctctc .: 23700 
accgggcccc ggggtcttga ccctgagtgg gtgacaggcc ecttctltttc caggctggga 23760 
gggtgtgacc agagtgtcat ggacctgata aagcgaaact ccggatgggt gtttgagaat 23820 
ccctgtgagt ctgacggcca cggcctcacc caggccgttc ttccctgcag catccccagg 23880 
ccccctgcgg gcagggacgg agggctgact Gcgctcccga gagccaggag gctacaggtt 23940 
ctggtcttgg gattgctgcc catcctgagg gtgaagagaa gcccctggac agaggattgg 24000 
aaccttgttc ccagcaggct cagacaccag tgaggtccag ccaaggtccc cataggtgca 24060 
gagggcgagg ggcctgttta gtggctgtct acaggtacag ggctgggctg gtgtgaccca 24120 
gccgctcctc gttcacacct gggacaggct ggggcagtgc tcgtctgtct gtccgtcagt 24180 
gtctgaaagc cccttgatgc tggcaccagg tggggtcggg ggtgggggac aaagcttggg 24240 
ggctctgggt gtgcttggag gggtctcctg gggagggggg ctctcacttg aattggcccc 24300 
tctgtccctg tggccagcat cagctcagcg gcccctggat ggatacagca. gctgccctcc 24360 
cctgaggcgg ggggttttgt ttcctagcaa taggcgtgct ggagctctgg gtgctggcca 24420 
ccaacttcag agactatgcc atcatcttca ctcagctgga gttcggggac gagcccttca 24480 
acaccgtgga gctgtacagfc aagtgtgctg tgcggggccc acgtagggag ctgtgtggtg 24540 
cgtgggccgc tcggggccca cgttgggagc tgtgtggtgc gtgggccgct tgggggccgc 24600 
gtggggagct gtgtggtgag tgggccgctc gggggccatg tggggagctg tgtggtgagt 24660 
gggccgcttg gggccacatg gggagctgtg tggtgagtgg gccgctcggg ggccatgtgg 24720 
ggagctgtgt ggtgggtggg ccgcacagga ccacatgcag gccaagttcc. cccaccttga 24780 
ggctcgctgc actggaggtg gctttggaag gaccggggcc agcatcccag ggcaggtggc 24840 
ctccttcctg acccttccca. gcccctcccc ggcccctccc cagctctcag aggttgcttc 24900 
cccctgcact gccctggtgc ccccaggtct gacggagaca . gccagccagg aggccatggg 24960 
gctcttcacc aagtg^agca ggagcctggg. cttcctgtca cagtagcagg dccagctgca 25020 
gaaggaccgt gagtgtccac cggagcagcc • tcgggggagg cttggggeaa gttctggagc 25080 
'ccacagggcg tgatggggtg acagagaccc tggggttgat tctggcttcti gcctgactca 25140 
gtcgccctga aggagtcagg caggctgagg gggtcttcac ctgcccctgc ccaaagtcag 25200 
gcctgggggt cttgtcccca cggcaggcga gtcttgtggg tggggccctg gggtgctggg 25260 
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gggctcagtc ctttttcaca gccagtcttg cctccccttc t^gtcacctg tgctcacaag 25320 

atccttctgg taagccgcca tcctgagcct caccctgggc tcfccttgggg gaagggggtt 25380 

ggggaggcca ccctacgcac agtgggtggg aggaccctct gcccaagccG acggggttca 25440. 

tggctccact: gtctctaagg cagctgaggg tgtggaggag cctccatggt gggctgggcc 25500 

ccctcacact cctefctggtt ttcagtgagt gctgcgtccc cagtagggat ggcgcccaca 25560 

gggtcctgtg acctcggcca gtgtccaccc acctcgctca gcggctcccg gggcccagca 25620 

ccagctcaga ataaagcgat tccacagcaa accaaggatg cttttgactg ggggccagcc 25680 

ggggaattgc ggggaggatg gcgggggtcg tcaccaaggg ccaagccaca gaaccataga 25740 

gccagctgca aagaagacgc gtgcccaacc cagtcccttg ccagagcccc tctgggtctt 25800 

cagacaccca aactcaggcc cgggctcaga ggcgggggct ggtcagagga cacagcctgg 25860 

gaggaagcca gcagtgagct ccaccaggac cgcagctggg gtgccagagg ctccaaggac 25920 

tgcccaggac aagagagagg tggctggtgg ggctcagagc agcccctgac tccccaggct 25 980 

vcaggcttctc tgggctgggg ttgtgtctct tgcctgggca ggggcaggga ccccagacac 2 6040 

ctgagcctgc aggfccctcct gtgttctgat gggtgtgggg ggctgcgctg cctgccgtcc 26100 

tgggagaagc acaggccatg ggggtctcgg ggctggatgg ggggctgggg actgtgtgag 26160 

gctcatgctg gagtgaggcg aggtgcacac agatgccttg acctgcaagg ctgggcatgc 26220 

^acacgcatgc acatgtgagc acacacccac acatgcacac acacacattc atgggcacgg 262 80 

gcacacagcc ccggcccgcc cttcggcacc tgctcaggtg cttggtgcca ggccctcaag 26340 

aaggtggtgg gggctgtgtg tgtggctgcc gcgcagagct caccatgtcc acatctgccc 26400 

cagaagggcc acgaccagcc ctttgcctgt cacctgggag gtctaggatg agggccaggg 26460 

cttaatgagt ctcaggggca cacacagtga ccccaggccg ggcactgagg gcaggagggt 26520 
cctggggcct cagaggggcc tctgtctgcc agggcagccc aggGcagcag tgaccaccag . 26580 

gtggggcagt gccggcttcc ctggagtgtc tgggggtggt ccctgggcgg ggagagggta 26640 

gcagagccca . aggccacacc tgttgcctcc ctccgctccc tcccctccct cctctcccga 26700 

cacaggctgg cggctccaga gaggtttaad cactggfcctt ggeggctgag ggaggagggt 26760 

gaagatgagg caggggctgc tggtgctggc gctggtgctg gtgctggtgc. tagtgctggc 26820 

tgcagggtcc caggtgcagg agtggtaccc cagggagtcc cacgccctca actggaacaa 26880 

ggtgagccat gtgccactgg tccctgagcc gggcgccggc tctggagctg gagcaggctc 26940 

ctgcacagtc tcccggcagc cgcccaggag cagggctacc agcagttcca ggcacgggcg 27000. 
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ctgcccacct ggcctctccc actgcagcct c.cagcaicagg . ggccccgcaa gttgcagggc 27060 

aaaggcaggg agcctcgagg gggttcgggg gcccaggggg gtcagggagg gccagggtgg 27120 

aggggcccag agagctgtgg ccgaggaggt cgggagacgc ccagagtgca gagtgggtgt 27180 

ggagcaagcg gggcatattc tgtagcggga gggcccgggc acctgctgag tgagccgtgg 27240 

gagcagcact cattggcacg gcggtgccca gggcgctagg gttggcagca ggcagagttg 27300 

ggggccgggg gccggcagga agggaagtgg ccgagcagta cccagccccc aggaggccat 27360 

gtgggaagga gccggggcct ctgaggtagg gggcctccGa cctgctggcg cccacagccG 27420 

agccctttgc tgcctggggt ccctgaccag ccgggcacct gcttctctgc agccc.tgctc 27480 

cagtggccag ggcttgagtg gccgggaccg aggcggcacc tgggcggagc tgggactccg 27540 

gctcacagaa gggtcatttg tctcctgggt tctctagctg ggaacctcgg gggtggggac 27600 

agcccccggt ggcagctcct gtccccgccc acacatccgc tgcgcagttt tcagggttct 27660 

ggtacattct ggecactgcc actgatgccc agggattctt gccggccagg gacaagagga 2772 0 

agctgggggc gtccgtggta aaggtgaaca aagtgggcca gctccgcgtg ctcctcgcct 27780 

tcagacggtg agtgggcagc gccccccagg caagggtctc cccttccagg aggagggiigc 27840 

tgcctcgggg ctggaaagga gtctgggagg agcctgtctg tctggaagtt ccacaagaac 27900 

cgtcccccca cttcctcccc ctaaactcag aggaagccac ctgcagcccc tgggacccca 27960 

catgggtggg tgttgggagg aagctgcctg cagaccttgg gaccccacat gggtaggtgt 2 802 0 

tgggaggaag ccgcttgcag ctcctgggcc ccacacgggt gggtgttggg agtttctctc 28080 

gtgccgtggg acaggcccca gtccgaattg gaaaatcctic aaaccacaga gcctcagatt 28140 

gacacgagtt: ggatagtgaa gcaaaaaact ggggacctgg^ ggccccaacc ttcccgggtc 28200 

ccacgggagc aggggcggga ccgagacgtg tgggtggggc tcggtggctg gcaggcggga 28260 

ggaggggttc cctggtctgc agcctgagcc cccgccttgg ccccaggttg aaggggfcgcc 28320 ■' 

agtcccagga ggtgatcctg aggaaagacg ggaagaagcc . ggtgtttggg aacgcctgtg 28380 

catacgcggc gggcccgagg gaaggacagg aigggaggtgt gcttgggctc tgggcbctgg 28440 - 

aggagctgcg tgatatgcct ctgaccccGc • tgtgciictt^ai gcaggcccct tgtccccgag 2850.0 

agcacgtggc aggcaggcgt caggctggct gtgttcccag agcccacacg tcatgtgctg 28560 

ggcggcctgg cccgggaggg gaggccccgg gagggcaetc agtccctctg gacacctgca 28620 

cctgggtggg ggctctgagg cctcgggcac caggggtcag- ggctgtgggc gggcacagcc 2 8680 

acgccatccg ggaaccagcg ggcatctctg ggcatctctt tcagtgaaag gggtgaaggc 28740 
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cttccacgtg ctgtccactg actacagctja cggcttggtc tacctccgcG tggggcgtgc 28800 
aacccaaaac tacaagaacc tgctgctctt ecgtgagcag gcacagcgga tggcagagga 28860 
tggtggaggg ggtgcagagg gaggcataga: ggggcgctgg gcacaggggg ccgggcagga 28920 
ggtgaggcgg gcagggagag tgggggttgg ggcggtggtc agggcagggg caLatggaagc 28980 
cgagaccaca gcagtcaggt cctggaggtc gctgaagcaa ggatagggga cgtcaggaag 29040 
aggctgcaga gaaaggtggg ccatdcaggg ggagcggiagg ccaccaggat ggtggcgggt 29100 
gcaggtgcaa cgttctgggt gagggacaca gtggggatga ggaggcgtgg gaggggtccc 29160 
aggtaagggt ctgcccggga tggaggggtg tggaagtgac ccagtgtggg ttaagggcca 29220 
caaacccaat tctagacacc aaccacttgg ggtacacgga aaggagtggc cctcccggtt 29280 
ggaacgggta gaggggaggt ggggtcctgg ggtctcagtc gaggcagggc acgctcaggg 29340 
tgcagcaagc tggggcacag ctgaggggct ggctggaggg gactctgcag gcagagaggg 29400 
ttccccaggt gacagcgtgg tgaccctggc ttcccgagcc tggctgcctc gtaagtgggg 29460 
aggggcccag cggccgctgt cctcccctgg ccaacccctg cttcacgcct aagctctgag 29520 
gacatttcag cagctccfcgg ggtagggatg ggggtcaggg tgcagaccct aaaatggcca 29580 
gagctggagt ctctgatgtg atcctgatct caacctcaga taggcagaat gtttcgagct 29640 
tccagagtct gaaggaattc atggacgctt gtgacattct ggggctctcc aaggccgccg 29700 
tcatcctccc gaaagacggt aagctgtgcc ctcagccctt tgccctcctg gccctacctg 29760 
cccgtcccat tggggagctg attcattggg ggaaggagag atgaaaggat ccatttgaaa 29820 
ggtgccattt gaaagatccg cccaaacgtg cctcccacct cctgccgccc ctgagaatcg 29880 
gggcttggcc tggcaggcct ggcaccgaca gcgagaggcg tccggagcag atgctgcatc 29940 
cgaccaggcg atgccgtggc tgtggctgca ggatcggggg ctgcatggct gtgttctcgt 30000 
ggtgcccaag acgagcacgc atggaaccag ggagcggccg ggggagacgt gttcacggct 30060 
catgtgtctc ctccaagcag ccagcccaca tgtggctgtt ttccfcctccG acagcgtccc 30120 
gtacacacac catcctgcca tgagagccag cgtcctctgc tcttcccttt cgacgcgtgg 30180 
tccteccaga cccgaggagc gttgttccag atgtcaagtg gjagaccacgg . cttgttgaga 30240 
ggtgtttggt gactctaggt cattgacagt gtgaaaatca agtggfcteaa gggatccttt 30300 
atgatcacag aaagatgggg gaaggaggat acaggctgac cgggtggcga gatgtcagcc 30360 
aggccccttc cccactgtct tctggagcac actgcaggct gctgctattc cctgtctgct 30420 
gtgaaacaga ccacaggccg gcccaacgca gtcctcgccg tgtgccttgc ctctgccctc 30480 
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cacgccgccg 
atatccaccc 
ctgcgttagg 
ccaacttcag 
tcgagtatag 
actcattcca 
acctgggaag 



ccaccctccg 
aacatggagg 
gaacagaatc 
cagtgtggtc 
accctagaag 
tccagggcag 
ggtggtgggt 



cgttccttlta 
agcGcceatg 
ggcaaacgca 
acct:ggat:ac 
tgtatctagg 
catcagggag 
gttaa 



-22- 
cat:cctgcaG 
acgtgccagg 
gtcccagagt 
tcgagtatag 
cgcggcca^gg 
agcttcctgg 



tttctcagga 
cagcgtgctg 
tgatgcctga 
accctagatc 
gcccdacaac 
gggaagtggc 



agccgt t cgc .30 5 40. 

gcccctgcgg 30600 

tcaggagaag 30660. 

acctggatac 30720 

agcaggtctg 30780 

atttgggctg 30840 
3 0865 



<210> 


3 


<211> 


531 


<212> 


DNA 


<213> 


Homo sapiens 


<220> 




<221> 


CDS 


<222> 


(1) . . (531) 


<400> 


3 


atg atg tea ttc ctg 


Met Met Ser Phe Leu 


1 


. 5 



10 



15 



acg get cag get gag gtt ctg ctg cag cet gac tte aat get gaa aag 
Thr Ala Gin Ala Glu Val Leu Leu Gin Pro Asp Phe Asn Ala Glu Lys 
20 25 30 

att gga gga ttc tgg agg gaa gtc ggt gtg gee tec gat caa age ctg 
He Gly Gly Phe Trp Arg Glu Val Gly Val Ala Ser Asp Gin Ser Leu 
35 40 45 

gtg ctg acg gee ccg aag egg gtg gag gge ttg ttc etc acc ttg age 
Val Leu Thr Ala Pro Lys Arg Val Glu Gly Leu Phe Leu Thr Leu Ser 
50 55 60 

ggg agt aac ctg ace gtg aag gtt gca tat aac age tea gga age tgt 
Gly Ser Asn Leu Thr Val Lys Val Ala Tyx Asn Ser Ser Gly Ser Gys 
65 70 75 80 

gag ata gag aag ate gtg ggc tea gaa at a gac agt acg gga aaa ttc 
Glu lie Glu Lys lie Val Gly Ser Glii lie Asp Ser Thr Gly Lys Phe 
85 • 90 95 

get ttt ect ggc cac aga gag ate cac gtg ctg gac acc gac tac gag 
Ala Phe Pro Gly His Arg Glu - He His Val Leu Asp Thr Asp Tyr Glu 
100. . . 105 - 110 

ggc tac gee ate ctg egg gtg tec ctg atg tgg egg . ggc agg aac ttt 
Gly Tyr Ala He Leu Arg. Val Ser Leu Met Trp Arg Gly Arg Asn Phe 
115 , 120 125 



48 



96 



144 



192 



240 



288 



336 



384 
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cgc gtc etc aag .tac ttt act egg .age ctt gag gac aag g^c egg .c^ 432 
Arg Val Leu Lys Tyr Phe Thr. Arg Ser Leu Glu Asp Lys Asp Arg Leu 
130 135 140 ' 



ggg ttc tgg aag ttt egg gag etg aca gca gab act ggt etc tac ctg 48d 
Gly Phe Trp Lys Phe Arg Glu Leu Thr Ala Asp Thr Gly Leu Tyr Leu 
145 150 155 160 

gcg gee egg cet ggg egg tgt gcc gag etc ctg aag gag gag ctg att 528 
Ala Ala Arg Pro Gly Arg Cys Ala Glu Leu Leu Lys Glu Glu Leu lie 
165 : 170 175 

taa 531 

<210> 4 

<211> 176 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Met Ser Phe Leu Leu Gly Ala lie Leu Thr Leu Leu Trp Ala Pro 
1 5 10 15 

Thr Ala Gin Ala Glu Val Leu Leu Gin Pro Asp Phe Asn Ala Glu Lys 
20 25 30 

lie Gly Gly Phe Trp Arg Glu Val Gly Val Ala Ser Asp Gin Ser Leu 
35 40 45 



Val Leu Thr Ala Pro Lys Arg Val Glu Gly Leu Phe Leu Thr Leu Ser 
50 55 60 



Thr Val Lys Val Ala Tyr Asn Ser Ser Gly Ser Cys 
70 75. .80 



Glu lie Glu Lys lie Val Gly Ser Glu lie Asp Ser Thr Gly Lys Phe 
85 SO 95 



Ala Phe Pro Gly His Arg Glu lie His Val Leu Asp Thr Asp Tyr Glu 
100 105 .110.. 

; Gly TVr iaa lie Leu Arg Val Seir Leu Met Trp Arg Gly Arg Asn Phe 
115 120 125 

Arg Val Leu Lys Tyr Phe Thir Arg Ser . Leu Glu Asp Lys Asp Arg Leu 
130 135 140 



Gly Ser Asn Leu 
65 
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Gly Phe Trp Lys Phe Arg Glu . I*eu Thr Ala Asp Thr Gly Leu T lieu 
145 150 155 160 

Ala Ala Arg Pro Gly Arg Cys Ala Glu Leu Leu Lys Glu Glu Leu lie 



<210> 5 

<211> 5159 

<212> DNA 

<213> Homo sapiens 

<220> 

<221>. 5'UTR 
<222> (1)..(5159) 
<400> 5 

tgtttatttt tattattttt tttttttaaa tttatttaac ataattttta ttaataataa 60 

taataaatat agtttgtaaa ttttaaaaat aaaagagaag agataaagaa tagattaaat 120 

gaattatata aatgaattaa ggtaatactc aatcctttgg atatgtggga ttatacacca 18 0 

tataattttg ttatatttaa tattagtata tatggtaggg tgtatgtgag aaaaggtaaa 24 0 

acttaaaaaa gcggtgaatt acaagatcgg taacgaaggt cgaaaagaaa ggaaatagcg 300 

ggtgtctggt gatgtcttgg atttgagggt cgggggtttc gaactctttt taaggtccat 360 

tatttaggcg cggcggggag agaggtatgt aaaaccgcaa gatggggtcc ttatcaggat 420 

aaggaccacg agtgactgtt agcagaaagt aatatcatga ggaaacagta aaaaaaatac 480 

ttatgagttg ggttgggagg ggactgagct ctgagaaagg accccagttc actatgtcaa 540 

aaacgaaaaa aaggcaaggc gcagtggctc acgcctgtaa tcccggcact ttgggaggcc 600 

aaggagggca gatcacctga ggticaggagt ttgagaccag cctgaccaac atggtgaaat 660 

cctgtctcta ctaaaaatgc aaaaaaaatt agacgggcat ggtggctact gttatcccag 720 

ctacatggga ggctgaggca ggagaatcgc fctgaacccgg gaggtggaga ttgcagtgag 7 80 

ctgagattgc accactgcac tccagcctga gtgacagagt gagaccccgt cttaaaaaac 840 

aaaacaaggc caggcacggt ggctciatgcc tgcaatccca gcactctggg aggctgaggc 900 

aggtggatca caaggtcagg agatcgagac catcctggct aacatggtga aaccctgtct 960 

ctactaaaaa tacaaagaat tatcccagca tggtggtgga cgcctgtggt cccagctact 1020 

ccggaggctg aggtgggaga atggcgtgaa cccaggaggt ggagcttgca gtgagctgag 1080 

attgcgccac agcactccag cctgggegac agagcgagac t'ccatctcaa aaaaagaaaa 1140 

aaaaaaaaaa aaaaaaaaga aaaaaaagaa aaaagaaaaa aagaaaagaa aacgaagaaa 1200 



1 
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aaccacaaat: aaataaaagt tgaggfcatg.a . tgtatgcacat gctgcattcg cccctttgag a26p 

ggcacagttg tgtgagttta. gacaaatgca aatgttcatg :.gaacci.ctgc aagtgcgaca 1320. 

cagcagctct: ggtgccckca aagcttcctc gaggtccctt gcggtcaatc actg-cccgca . 1380 

cctcagccc t • cagagccact : gggaagcccc ctgccctigftg ggctcacctg tcctagaacg 144 0 

tcgtataaaa ggagtcgcag atcgcagcet tttceacctg gcttctgtcc ctctgcaggc 1500. 

tgcacctggg attcagccac cttcgttcct tttactgctg agtgtccatc acgtgctgct , 156.0 

catgcagaag tgctcaagtt cgctgtaaac gtagttttca tttctcccgg gtggagacct 1620 

ccataagtag gtaggttggt ggaatgtttc actgcatgag agtctgccag gaagttttgc 1680 

agtgtggtcc cgtcttgttc ttcgcgcaga gggtgatagc tctgctgact cccagccctc 1740 

: acaggctcta gggattgagc ctgtttctgt ggaacgcctt ccatggtctg cgaggtgcac 1800 

agaggggtct caaaaagctt ccaggttgca tcttcccatg gactgatgct gagcgtgttc 1860 

tcatctccca tccggatgtt ttctttggtg gagtatcttt tcagatctct tgccttttta 1920 

tttttatttg tttatatata tattttgaga cggaattttg ctctttcact taggttggag 1980 

tgcaacggcg cgatcttggc tcactgcaac ctccccccac ccacctcact ttgggttcaa 2040 

gcgattctcc tgcctcagcc tcccgagtag cfcgggactac aggcgcccgc caccatgccc 2100 

ggctaatttt tgtattttga gaagagatag ggttt caeca tgttggccag gctggtctqg 2160 

aactcctgac ctcaggtgat ccacccgcct cggcctccca aaatgcgagg atcacaggcg 2220 

tgagatgcca cacctggcct ctttttattt ttaatggagt ggcttgtttt cttattactc 2280 

' agtcttggga gttcctcatg gatgctggat ttcagtctcg tctcaggtgt atgcgttttg 2340 

tagatgtttt ctgtagggac cagccccaca gggttggtga . gtttctccct gtgtgctgag 2400 

atgagagggc atagaaataa ggacacaaga aaaagacata . aaagaaaaga cagctgggcc 2460 

*. tggggaccac taccaccaag acgtggagac cggtagtggc cccgaatgtc tggctgtgct 252 0 

gatatttatt ggatacaaag caaaagggac agggtaaaga- gtgtgagtca tctccaatga 2580 

taggtaaggt gacgtgagtc acgtgtccac. cggatggggg gcccttccct gtttggcagc 2640 

caaggcggag agagagagag. agagadagct tacatcatta tttctgcata tcagagactt 2700 

ttagtacttt. cactaattga ctactgctat ctagaaggca gagccaggtg tacaggatgg - 2760 

aacatgaagg cggactacga gcgtgaccac tgaagcacag. catcacaggg agacggttag 2820 

gcctctggat aactgcgggc aggtctgact gatgtcaggc. cctccacagg - aggtggagga 2880 

gtagagtcct ctctaagctc ccccggggga aagggagact ccctttccct gtctgctaag 2940 
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tagcaggtgt tttcccttgai cactgacgct .-actgctagaLt . cacggtccgc ttggcaaccg 


3000 


gtgtcttccc agacgctggc gtcaccacta 


gaccaaggag 


ccctctggtg 


gccctgtccg 


3060 


gcgtaacaga aggGt:cgcac 


tcttgtcttc 


tggtcacttc 


tcactatgtc 


cGctcagctc 


3120 . 


ctatctctgt atggcctggt 


ttttcctagg- 


ttataattgt 


agagcaagga t tat tataat 


3180 


attggaataa agagtaattg ctacaagcta atgattaata atattcatat ataatcatgt 


3240 


ctatgatcta gatctagtat aactcttgtt attttaitata ttttattaaa ctggaacagc 


3300 


tcgtgccctc ggtctcttgc 


ttcggca:tct 


gggtggcttg 


ctgaccacag 


ttttcctgct 


33 60 


gtatctaact gctggcttag gtgaacctgt 


caggggcgtg gctgggaact ggaacctcct 


3420 


gaagaccagt ggaggccaca 


agcagaagcc 


ctcggctcct 


tcccttcccc 


gatgctgggg 


3480 


gccaagggtc tttctctgac 


cctgtccccc 


cctcacgtta 


ccagtggagg gtgtccaggt 


3540 


tcttggcatt ttgaacaaag 


aattggacaa 


aacacacaaa 


caaagcaagg 


aaagaatgaa 


3600 


gcaacaaaag cagaaattta 


ttgaaaatga 


aagcactctt 


tacagggtgg 


gagcgggctg 


3660 


agcaagcagc tcaagggccc 


cggttacaga 


attttctggt 


gtttcaatat 


cctctagcgg 


3720 


tttaccattg gttacttggt 


gtatgcccta 


tgtiaaatgaa 


gaggatgaag 


tcaaagtcat 


3780 


tttctcggct gagcatactg 


tcacgcctgt 


aatcccagta 


ctttgggagg 


ctgaggtggg 


3840 


tgtatcacct gaggtcggga 


gtrtcaagacc 


agcctgatga 


acatggagaa 


accccgtctc 


3900 


tactaaaaat acaaaattag 


ctgggtgtgg 


tggcgggtgc 


ctgtaatccc 


agatagtcag 


3960 


gaggctgagg caggaaaatc 


gcttaaaccc 


gggaggcaga 


ggctgcagag 


agccaaggag 


4020 


tgcaccactg cactccagcc 


tgggcaacaa 


gagcaaaact 


ccatctcaaa 


aaacaaacaa 


4080 


acaaaacgaa gtcatgtact cggtatgagc cctiacgtaaa cggagaggat ggagtgaagg 


4140 


tacaaagcca tteacattcc 


cgtcatcgtt agtgtttcca gttgatttgg 


ttctaggatg 


4200 


cccttgggct ccctgcgtcc 


aggccctctt 


ctbctgcctc 


actctcactc 


cagaggactg 


4260 


gcagtgctgg cctccctggc 


aggactcccg 


cccctgacca gaiggggtctg ggcgtgctcc 


4320 


gaggccacct gctgctcctt gcccagtgtc 


c.ctggigtgc.t ctgtggggac 


acagagaggt 


4380 


cacttgggag gttcctgcct ctgccagact gcggcgbctt 


cctggcgtca 


ttgccccctg 


4440 


cacgcttata caaccagtga 


taaggcggta 


' tccaatgccg 


taatcctccc 


cagggceagc 


450O 


ccagccctgt ccggtcccgt 


ccggtccctg 


gcctgcttgc 


tgggagccca 


gctgggggct 


4560 


gcatcggggg tggtgggggt ctggagtgag gggtgcccag gcctctctca cagtgtgggg 


4620 


gcacaagctc tgccctggac 


actctgccag 


tgtgacccta 


ccccccatgt 


tgggggtgag 


4680 
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agaagggggc tttctgggtg gccagatccc aggagagggg gcctagccag accaacccea 4740 

gcccagctct ctgcagctgt accccaaagg ccctggccca gcccagccag acagagtgtg 4600 

gacagagggg cttgggtgcg agtccaggca aggcaggaca agtctctgct. gtggctt:ggc 4860 

agtggccccg tgactggtga catagtctgg caggcctagg ggacaaatcc tcagccicggg 4920 

tgcatcaggc cagagcctgg ctgtctggag ccctccagga aatgaccccc cgggctgggg 4980 

gacatccgag tgattgtgcc aaacaatgga caagggggcc ggagtctcag agccgaagtg 5040 

ccttccgctt ccatctgctc cgcccaggag caggtgcacc caggtggtgg cctgggctat 5100 

aaagctggcc ccctggggct tggggactca gcaccagggg ctggagggca ggggagggg 5159 

<210> 6 

■ J <211> 175 

<212> PRT 

<213> Mus musculus 

<220> 

<221> mat_peptide 

<222> {1)..() 

<400> 6 

Met Glu Ala Arg beu lieu Ser Asn Val Cys Gly Phe Phe Leu Val Phe 
1 5 10 15 

Leu Leu Gin Ala Glu Ser Thr Arg Val Glu Leu Val Pro Glu Lys lie 
20 25 30 

Ala Gly Phe Trp Lys Glu Val Ala Val Ala Ser Asp Gin Lys Leu Val 
35 40 " 45 . 

Leu Lys Ala Gin Arg Arg Val Glu Gly Leu Phe Leu Thr Phe Ser Gly 
50 55 60 

® Gly Asn Val Thr Val^ Lys Ala Val Tyr Asn Ser Ser Gly Ser Cys Val 
65 70 75 80 

Thr Glu Ser Ser Leu Gly Ser Glu Arg Asp Thr Val Gly Glu Phe Ala 
85 90 95 

. Phe Pro Gly Asn Arg Glu lie His Val Leu Asp Thr Asp Tyr Glu Arg 
100 105 110 

Tyr Thr lie Leu Lys Leu Thr Leu Leu Trp Gin Gly Arg Asn Phe His 
115 120 125 

Val Leu Lys Tyr Phe Thr Arg Ser Leu Glu Asn Glu Asp Glu Pro Gly 
130 135 140 

Phe Trp Leu Phe Arg Glu Met Thr Ala Asp Gin Gly Leu Tyr Met Leu 
145 iSO 155 160 

Ala Arg His Gly Arg Cys Ala Glu Leu Leu Lys Glu Gly Leu Val 
165 170 175 
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<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<400> 



7 

27 
DNA 



Mus mus cuius 



p r imer_b ind 



7 



(1) (27) 



cttctctggt acaagctcca ccctggt 



27 



<210> 8 

<211> 24 

<212> DNA 

<213> Mus mUsculus 

<22G> 

<221> primer_bind 

<222> (1) . . (24) 

<400> 8 

tggatggata gatgcataca tgag 24 



<210> 9 

<211> 20 

<212> DNA 

<213> Mus musculus 

<220> 

<221> primer 

<222> (1) . . (20) 

<400> 9 

caacggtggt atatccagtg 20 



<210> 


10 


<211> 


21 


<212> 


DNA 


<213> 


Mus musculus 


<220> 




<221> 


primer 


<222> 


(1) . . (21) 


<400> 


10 



gatgtgctcc aggctaaagt t 21 



<211> 


21 


<212>J 


DNA 


<2i3> 


Mus musculus 


<220> 




<221> 


primer 


<222> 


(1) (21) 


<400> 
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